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BACKGROUND OF THE INVENTION 

10 

Field of the Invention 

The present invention relates to a -motion vector calculating method 
suitable for performing an encoding process corresponding to for example 
MPEG (Moving Picture Experts Group) 2 method by software process. 
1 5 The present invention also relates to a record medium on which a motion 
vector calculating program has been recorded. Description of the Related 
Art 

As a highly efficient compressing method for a picture, MPEG2 
method has become common. In the MPEG2 method, a video signal is 
20 compressed and encoded by a motion compensation predictive encoding 
method and DCT (Discrete Cosine Transform) method. 

In the MPEG2 method, three types of pictures that are an I (Intra) 
picture, a P (Predictive) picture, and a B (Bidirectionally Predictive) picture 
are transmitted. For an I picture, DCT encoding process is performed 
25 with pixels of the same frame. For a P picture, with reference to an I 
picture or a P picture that has been encoded, DCT encoding process is 
performed using motion compensation predicting process. For a B 
picture, with reference to I pictures or P pictures that precede and follow 
the B picture, DCT encoding process is performed using motion predicting 
30 process. 

For a P picture or a B picture, an intra-MB (intra-macro block) 
encoding process or a an inter-MB (inter-macro block) encoding process 



may be performed for each macro block ("MB"). The determination of 
whether to apply an intra-MB process or an inter-MB process is made on a 
macro block basis. 

Fig. 1 is a block diagram showing an example of the structure of a 
5 conventional MPEG 2 encoder. Referring to Fig. 1 , picture encoding 

apparatus 100 includes frame buffer 102, motion detecting portion 1 14 
and controlling portion 1 18. Picture data, in the form of a component 
digital video signal, are supplied to input terminal 101. The component 
digital video signal is composed of a luminance signal Y and color 
10 difference signals Cb and Cr. The digital video signal is supplied from 
input terminal 101 to frame buffer 102. Frame buffer 102 temporarily 
stores the digital video signal. Frame buffer 102 has a storage capacity 
I for at least three frames of pictures of a current picture, a past reference 

i picture and another picture. Frame buffer 102 outputs picture data to 

i 15 motion detecting portion 1 14 and calculating portion 104 at 
m predetermined times under the control of controlling portion 118. 

Motion vector detecting circuit 1 14 obtains a motion vector 
1 between a reference picture and a current picture using data stored in 

U frame buffer 102. A motion vector ("MV") is obtained for each macro 

1 20 block. Each macro block is composed of, for example, 16x16 pixels. 
The obtained motion vector MV is supplied through controlling portion 
1 18 to variable length code encoding circuit 1 10, to motion compensating 
circuit 1 20 and to calculating portion 1 04 at predetermined times under 
the control of controlling portion 118. 
25 Controlling portion 1 1 8 determines a macro block type for the 

encoding process with motion vector MV and motion residual information 
AD received from the motion detecting portion 1 1 4. Controlling portion 
1 1 8 determines whether the current macro block is an inter-macro block 
or an intra-macro block corresponding to, for example, the picture type. 
30 An inter-macro block is a macro block that is motion-compensated jwith a 
motion vector MV and encoded with a residual. In contrast, an intra- 
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macro block is a macro block that is simply encoded without moving 
components. 

Controlling portion 118 generates control information that causes 
switches 130 and 1 16 to operate corresponding to the determined macro 
5 block type. In addition, controlling portion 118 supplies motion vector 
MV received from motion detecting portion 1 14 to motion compensating 
portion 120. 

Picture encoding apparatus 1 00 also includes DCT process portion 
106, quantizing process portion 108, variable length code encoding 

10 portion 1 10, and buffer 1 12. Calculating portion 104 receives a picture 
signal from frame buffer 102. DCT process portion 106 performs a DCT 
{Discrete Cosine Transform) process for picture data. Quantizing process 
portion 1 08 quantizes a DCT coefficient received from DCT process 
portion 1 06. Variable length code encoding portion 1 1 0 compresses a 

15 DCT coefficient received from quantizing process portion 108 with 
variable length code. Buffer 1 12 stores picture data received from 
variable length code encoding portion 1 10. 

DCT process portion 1 06 performs a two-dimensional DCT process 
for each macro block of (8 x 8) pixels of picture data received from 

20 calculating portion 207. DCT process portion 106 supplies a DCT 
coefficient to quantizing process portion 108. 

Quantizing process portion 1 08 quantizes a DCT coefficient 
received from DCT process portion 106 with a quantizing scale that varies 
corresponding to each macro block. Quantizing process portion 1 08 

25 supplies the quantized DCT coefficient to variable length code encoding 
portion 1 10 and to inversely quantizing process portion 126. 

Variable length code encoding portion 1 10 receives a DCT 
coefficient from quantizing process portion 1 08 and a motion vector M V 
from controlling portion 118. With such information, variable length code 

30 encoding portion 110 performs an encoding process. Variable length 
code encoding portion 110 performs an encoding process with variable 
length code corresponding to MPEG syntax and performs a header 
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process, a code generating process, and so forth so as to generate picture 
data. Variable length code encoding portion 1 10 supplies the encoded 
picture data to buffer 112. 

Buffer 1 1 2 stores picture data received from variable length code 
encoding portion 1 1 0 and outputs the picture data as a bit stream at 
predetermined time intervals under the control of controlling portion 1 1 8. 

in addition, the picture encoding apparatus 100 has inversely DCT 
process portion 124, calculating unit 128 and buffer 122. Inversely 
quantizing process portion 126 inversely quantizes a DCT coefficient 
received from quantizing process portion 108. Inversely DCT process 
portion 124 inversely performs a DCT process for a DCT coefficient 
received from inversely quantizing process portion 1 26. Calculating unit 
1 28 receives picture data from inversely DCT process portion 1 24. Buffer 
122 stores picture data. Motion compensating portion 120 motion- 
compensates picture data received from buffer 122. 

Inversely quantizing process portion 126 inversely quantizes a DCT 
coefficient received from quantizing process portion 108. Inversely 
quantizing process portion 1 26 inversely quantizes data received from 
quantizing process portion . 108 with the quantizing scale thereof and 
supplies the resultant DCT coefficient to inversely DCT process portion 
124. 

Inversely DCT process portion 124 inversely performs a DCT 
process for a DCT coefficient received from inversely quantizing process 
portion 1 26 and supplies the resultant DCT coefficient to calculating unit 
1 28. Calculating unit 1 28 receives picture data that has been processed 
in inversely DCT process portion 1 24. In addition, calculating unit 1 28 
receives picture data (that has been motion-compensated) through switch 
130. Calculating unit 128 adds the motion-compensated picture data and 
the picture data received from inversely DCT process portion 1 24 and 
supplies the resultant data to buffer 1 22. 

Buffer 1 22 receives each macro block of picture data from 
calculating unit 1 28 and stores the picture data. When motion 



compensating portion 1 20 motion-compensates picture data, predictive 
picture data are read from buffer 1 22. 

Motion compensating portion 1 20 reads each macro block of 
predictive picture data from buffer 1 22 corresponding to a motion vector 
5 MV. When picture encoding apparatus 100 generates an intra macro 

block picture, each macro block of picture data stored in frame buffer 102 
is supplied to DCT process portion 106 and quantizing process portion 
108 through calculating unit 104. DCT process portion 106 performs the 
DCT process for each macro block of the picture data. Quantizing 

10 process portion 108 quantizes the picture data received from DCT 

process portion 106. Variable length code encoding portion 1 10 encodes 
the picture data received from quantizing process portion 1 08 with 
variable length code and outputs the resultant data as a bit stream 
through buffer 1 1 2. The resultant signal that has been processed by 

15 quantizing process portion 108 and variable length code encoding portion 
1 10 is restored to picture data by inversely quantizing process portion 
126 and inversely DCT process portion 124 and temporarily stored to 
buffer 122. 

When picture encoding apparatus 1 00 generates a forward 
20 predictive MB in a P (predictive) picture, motion detecting portion 114 
detects a moving component of picture data stored in frame buffer 1 02 
so as to generate a motion vector MV. In addition, residual information 
generating portion 204 generates residual information AD. Motion vector 
MV is supplied to motion compensating portion 1 20 through controlling 
25 portion 118. Motion compensating portion 120 motion-compensates 
picture data stored in buffer 122. (When the I picture is generated, the 
picture data is stored to buffer 122). Thus, motion compensating portion 
1 20 generates predictive data. Motion compensating portion 1 20 motion- 
compensates each macro block. Switches 1 30 and 1 1 6 are closed 
30 corresponding to a switch control signal received from controlling portion 
1 18. Calculating unit 104 subtracts the predictive picture data received 
from motion compensating portion 1 20 from the picture data stored in 
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frame buffer 102. DCT process portion 106 and quantizing process 
portion 108 perform the above-described processes. Variable length code 
encoding portion 1 1 0 encodes picture data and outputs the resultant data 
as a bit stream through buffer 112. 
5 When picture encoding apparatus 100 generates a bi-directional 

predictive MB in a B (Bi-directionally predictive) picture with a backward 
reference frame and a forward reference frame, motion compensating 
portion 120 motion-compensates picture data of the preceding frame 
stored in buffer 122 and picture data of the next frame so as to generate 
10 predictive picture data. Calculating unit 104 subtracts the predictive 
picture data from the picture data stored in frame buffer 102. DCT 

q process portion 1 06 and quantizing procfess portion 1 08 perform the 

above-described processes. Variable length code encoding portion 1 10 
encodes the data received from calculating unit 1 04 with variable length 

O 1 5 code and outputs the resultant data as a bit stream through buffer 1 1 2. 

In recent years, since the process speeds of CPUs (Central 

L Processing Units) are becoming very fast and memories with large storage 

"4 capacity are becoming inexpensive, the above-described MPEG2 encoding 

iy can be performed by software. 

ff 20 However, in the MPEG2 encoding process, a process for calculating 

a motion vector is required. A motion vector is obtained by a block 
matching process. In other words, a block with the same size and the 
same origin as a block divided from the current frame to be processed is 
extracted from a reference frame. While the block of the reference frame 
25 is being moved in a predetermined search area, the sum of the absolute 
values of the difference values between pixels of the block of the 
reference frame and pixels of the relevant block of the current frame is 
obtained as a residual. A block of the reference frame with the minimum 
residual is obtained. Since the block matching process requires many 
30 calculating steps, it is difficult to perform the MPEG2 encoding process 
using software. 
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In other words, when the motion vector of a block CBLK of a 
current frame 401 shown in Fig. 2 is obtained, a search area SA is 
defined on the periphery of a block RBLK of the reference frame 402 
corresponding to the block CBLK as an origin. The block RBLK of the 
5 reference frame is extracted from the search area SA. The difference 
values between (16 x 16) pixels of the reference block RBLK and (16 x 
16) pixels of the current block CBLK are obtained. The sum of the 
absolute values of the difference values is obtained as a residual. The 
block RBLK of the reference frame 402 is moved in the predetermined 

10 search area SA. At each position of the block RBLK in the search area 
SA, the difference values between pixels of the block RBLK and pixels of 
the block CBLK of the current frame 401* are obtained. The sum of the 
absolute values of the difference values is obtained as a residual. The 
obtained sums are compared. A block with the minimum residual is 

1 5 treated as a matched block. With the matched block, a motion vector is 
obtained. 

To detect a motion vector by the block matching process, when 
each block is composed of (16 x 16) pixels, to obtain difference values of 
pixels, 16 x 16 = 256 subtracting operations are required. To obtain the 

20 sum of the absolute values of the difference values, 256 adding 
operations are required. 

When a motion vector is detected by moving a reference block in a 
predetermined search area at one pixel step, residuals should be obtained 
a number of times corresponding to the number of pixels in the search 

25 area. Thus, when residuals are obtained by moving a block in a 
predetermined search area at one pixel step and a motion vector is 
detected with the position of a block with the minimum residual, the 
number of calculating steps becomes huge. Thus, it is difficult to perform 
the MPEG2 encoding process with software. 

30 To search a motion vector at high speed, two approaches can be 

considered. As the first approach, whenever the block matching process 
is performed, the number of calculating steps is decreased. As the 
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second approach, the number of times of the block matching process in a 
search area is decreased. As an example of the first approach, while the 
sum of the absolute values of the difference values between pixels of a 
block of the reference frame and pixels of the relevant block of the 
5 current frame is being calculated, the sum is compared with a 
predetermined threshold value. When the sum is larger than the 
predetermined threshold value, the process is terminated. 

A motion vector is obtained by obtaining the minimum value of the 
sum of the absolute values of the difference values between pixels of a 

1 0 block of the reference frame and pixels of the relevant block of the 

current frame. Thus, when the sum exceeds the predetermined threshold 
value, the sum does not become the mirfimum value. Thus, it is 
meaningless to continue the process. Consequently, when the sum is 
larger than the predetermined threshold value, the process is terminated. 

15 As a result, the number of calculating steps can be decreased and a 
motion vector can be detected at high speed. 

However, in this case, it is difficult to assign such a threshold 
value. When the threshold value is too small, the process is terminated at 
all points. Thus, a motion vector cannot be correctly detected. In 

20 contrast, when the threshold value is too large, since the process is not 
terminated, the efficiency of the process cannot be improved. 

To decrease the number of calculating steps for the block matching 
process, one method is to thin out pixels of a reference frame and pixels 
of a current frame "checkerwise," i.e., according to the pattern made by 

25 squares on a checker board. In this case, the number of calculating steps 
for calculating the sum of the absolute values of the difference values can 
be halved. 

Since an MMX instruction allows a plurality of successive data 
pieces to be processed at a time, recent personal computers are equipped 
30 with CPUs that handle an MMX function. Since the block matching 

process obtains the sum of the absolute values of the difference values 
between pixels, with an MMX instruction, the block matching process can 
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be performed at high speed. However, when pixels of blocks are thinned 
out checkerwise, since the continuity of data of pixels is lost, an MMX 
instruction cannot be used. Thus, even if pixels of blocks are thinned out 
checkerwise and thereby the number of times of the block matching 
5 process is decreased, the process time cannot be remarkably shortened. 

As a conventional moving picture encoding apparatus that encodes 
picture data of a moving picture, a DCT (Discrete Cosine Transform) 
process is performed for each block of (8 x 8) pixels. 

The moving picture encoding apparatus detects a motion vector 
1 0 between adjacent frames and motion-compensates the moving picture 

with the detected motion vector so as to decrease the amount of encoded 
data. 

Conventional moving picture encoding apparatuses detect the 
motion of a picture in various methods. When the motion of a picture is 

1 5 detected, macro blocks at a relevant position of adjacent frames are 

sometimes compared. When the macro blocks are compared, the moving 
direction of the picture is unknown. Thus, the predetermined area around 
the relevant position of the adjacent frames is searched for macro blocks 
with a small difference of luminance values. 

20 When a camera that photographs a moving picture is fixed at a 

predetermined position and an object is being moved, the moving 
direction of the object varies at each position of the entire picture. Thus, 
macro blocks around a start point are searched. When the motion of the 
object is large, it may deviate from the search area. In this case, the 

25 intra-MB encoding process is performed instead of the inter-MB encoding 
process. 

In a conventional moving picture encoding apparatus, a plural 
macro block searching method has been proposed so as to decrease the 
number of calculating steps for detecting the motion of a picture. In this 
30 method, the motion of a picture is detected so that one motion vector is 
detected with a plurality of macro blocks. To obtain a motion vector of 
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each macro block, the motion vector for a plurality of macro blocks is 
used. 

However, in such a moving picture encoding apparatus, macro 
blocks are searched in a predetermined area so as to detect macro blocks 
5 whose difference is small. Thus, when an object that moves a lot is 
processed, it is necessary to widen the search area. Therefore, the 
process time necessary for detecting the motion of a picture exponentially 
increases. 

In the moving picture encoding apparatus, when the motion of an 
10 object is large, the intra-MB encoding process is performed instead of the 
inter-MB encoding process. In this case, when the motion of a picture 
exceeds a predetermined search area, thfe intra-MB encoding process is 
performed for all macro blocks. This situation takes place in the case that 
when an object is panned, the entire picture moves outside of the search 
15 area. 

In a moving picture encoding apparatus that obtains motion vectors 
of a plurality of macro blocks, when the picture moves off of the screen, 
a motion vector cannot be detected. Thus, motion vectors of a plurality 
of macro blocks cannot be detected. 
20 Fig. 3 shows an example of the structure of a conventional moving 

picture encoding apparatus that encodes picture data of a moving picture 
by an encoding process that is a DCT (Discrete Cosine Transform) 
process. 

In Fig. 3, an input MB (Macro Block) signal S51 1 is supplied to a 
25 terminal 501 . A motion vector signal MV is supplied as macro blocks MB 
one by one to a terminal 502. The input MB signal S51 1 and the motion 
vector signal MV are supplied to a motion compensating circuit 503. 

The motion compensating circuit 503 has an internal picture 
memory. A predictive picture signal (hereinafter referred to as predictive 
30 MB signal) is read as macro blocks MB one by one from the picture 
memory corresponding to the motion vector signal MV. The motion 



- 10 - 



compensating circuit 503 outputs a signal S512 that is the predictive MB 
signal obtained from the motion vector signal MV. 

A calculating device 504 adds the input MB signal S51 1 that is an 
addition signal and the signal S512 that is a subtraction signal as macro 
5 blocks MB one by one. Thus, the calculating device 504 calculates the 
difference between the input MB signal and the signal S512 and outputs 
the difference as a predictive residual MB signal S513. 

The predictive residual MB signal S513 is supplied to a DCT circuit 
505. The DCT circuit 505 performs a two-dimensional DCT process for 
10 each block of (8 x 8) pixels of the predictive residual MB signal S513 and 
outputs DCT coefficient S514, which is supplied to a quantizing circuit 
506. 

The quantizing circuit 506 receives DCT coefficient S514 from DCT 
circuit 505, a quantizing scale mQ received from terminal 507 and a 
1 5 signal from motion compensating circuit 503, and outputs a quantized 
signal. 

The quantized signal received from the quantizing circuit 506 and a 
motion vector MV corresponding thereto are supplied to a variable length 
code encoding (VLC) circuit 508. The variable length code encoding 

20 circuit 508 encodes the quantized signal and the motion vector MV with 
variable length code corresponding to MPEG syntax. 

An output signal of the variable length code encoding circuit 508 is 
supplied to a buffer memory 509. The buffer memory 509 smoothes the 
fluctuation of the number of bits of data that is generated in a short time 

25 period and received from the variable length code encoding circuit 508 
and outputs an encoded bit stream at a desired bit rate. The encoded bit 
stream that is received from the buffer memory 509 is output from a 
terminal 510. 

The quantized signal and the quantizing scale received from the 
30 quantizing circuit 506 are supplied to an inversely quantizing circuit 511. 
The inversely quantizing circuit 511 inversely quantizes the quantized 
signal corresponding to the quantizing scale. An output signal of the 
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inversely quantizing circuit 51 1 is supplied to an inversely DCT circuit 
512. The inversely DCT circuit 512 performs an inversely DCT process 
for the signal received from the inversely quantizing circuit 51 1 and 
outputs the resultant signal as a predictive residual MB signal S51 5 to a 
calculating device 513. 

The calculating device 513 also receives the predictive MB signal 
S512 that is supplied to the calculating device 504. The calculating 
device 513 adds the predictive residual MB signal S515 and the predictive 
MB signal S512 and outputs a locally decoded picture signal. This picture 
signal is the same as an output signal of the receiver side (decoder side). 

The conventional moving picture encoding apparatus performs the 
DCT process and the quantizing process'for all pictures received from the 
terminal 501. The moving picture encoding apparatus determines 
whether or not the DCT coefficient of each macro block of the picture to 
be encoded is present after performing the DCT process and the 
quantizing process for the picture data and completing all calculating 
steps for the DCT coefficient. 

However, since the moving picture encoding apparatus performs 
the calculating steps for all macro blocks even if their DCT coefficients 
finally become "0", unnecessary calculating steps should be performed. 

In addition, since the conventional moving picture encoding 
apparatus determine whether or not all DCT coefficients of macro blocks 
are "0" only after performing all calculating steps, all the calculating steps 
should be performed. 

Objects And Summary Of The Invention 

The present invention is made from the above-described point of 

view. 

A first object of the present invention is to decrease the number of 
calculating steps of the block matching process for detecting a motion 
vector and to detect it at high speed. 
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A second object of the present invention is to provide a motion 
vector calculating method and a recording medium on which a program 
thereof has been recorded, the motion vector calculating method allowing 
the number of times of the block matching process in a predetermined 
5 search area to be decreased so as to increase the process speed and an 
MMX instruction to be effectively used. 

A third object of the present invention is to provide a motion 
detecting apparatus and a motion detecting method that allow a motion 
vector of a picture that largely moves on the entire screen to be detected. 

1 0 A fourth object of the present invention is to provide a picture 

encoding apparatus and a picture encoding method that allow a time 
period of an encoding process for a picture whose DCT coefficient finally 
becomes "0" to be shortened. 

A first aspect of the present invention is a motion vector 

15 calculating method, comprising the steps of (a) extracting a block from a 
reference picture corresponding to a block of a current picture to be 
processed, the size of the block of the reference picture being the same 
as the size of the block of the current picture, the origin of the block of 
the reference picture matching the origin of the block of the current 

20 picture, (b) while moving the block of the reference picture in a 

predetermined search area, obtaining a residual between the block of the 
current picture and the block of the reference picture, (c) detecting a 
block with the minimum residual from the reference picture so as to 
calculate a motion vector, (d) orthogonally transforming pixel data of a 

25 block of the reference picture and pixel data of a block of the current 
picture, and (e) obtaining a residual between orthogonally transformed 
data of the block of the reference picture and orthogonally transformed 
data of each block of the current picture. 

A second aspect of the present invention is a recording medium on 

30 which a motion vector calculating program has been recorded, the motion 
vector calculating program causing a system that has the recording 
medium to perform the steps of (a) extracting a block from a reference 
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picture corresponding to a block of a current picture to be processed, the 
size of the block of the reference picture being the same as the size of the 
block of the current picture, the origin of the block of the reference 
picture matching the origin of the block of the current picture, (b) while 
5 moving the block of the reference picture in a predetermined search area, 
obtaining a residual between the block of the current picture and the 
block of the reference picture, (c) detecting a block with the minimum 
residual from the reference picture so as to calculate a motion vector, (d) 
orthogonally transforming pixel data of a block of the reference picture 
1 0 and pixel data of a block of the current picture, and (e) obtaining a 
residual between orthogonally transformed data of the block of the 
O reference picture and orthogonally transformed data of each block of the 
current picture, 

fg A third aspect of the present invention is a motion vector 

1 5 calculating method, comprising the steps of (a) extracting a block from a 
IH reference picture corresponding to a block of a current picture to be 

q processed, the size of the block of the reference picture being the same 

? ?J as the size of the block of the current picture, the origin of the block of 

W the reference picture matching the origin of the block of the current 

12 20 picture, (b) while moving the block of the reference picture in a 

predetermined search area, obtaining a residual between the block of the 
current picture and the block of the reference picture, (c) detecting a 
block with the minimum residual from the reference picture so as to 
calculate a motion vector, (d) while calculating a residual between pixels 
25 of a block of the reference picture and pixels of a block of the current 
picture, comparing the obtained residual with a predetermined threshold 
value, and (e) when the residual is larger than the predetermined threshold 
value, stopping the calculation of the motion vector, and (f) setting the 
initial value of the predetermined threshold value corresponding to a 
30 characteristic of a picture. 

A fourth aspect of the present invention is a recording medium on 
which a motion vector calculating program has been recorded, the motion 
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vector calculating program causing a system that has the recording 
medium to perform the steps of (a) extracting a block from a reference 
picture corresponding to a block of a current picture to be processed, the 
size of the block of the reference picture being the same as the size of the 
5 block of the current picture, the origin of the block of the reference 

picture matching the origin of the block of the current picture, (b) while 
moving the block of the reference picture in a predetermined search area, 
obtaining a residual between the block of the current picture and the 
block of the reference picture, (c) detecting a block with the minimum 

10 residual from the reference picture so as to calculate a motion vector, (d) 
while calculating a residual between pixels of a block of the reference 
picture and pixels of a block of the current picture, comparing the 
obtained residual with a predetermined threshold value, and (e) when the 
residual is larger than the predetermined threshold value, stopping the 

1 5 calculation of the motion vector, and (f) setting the initial value of the 
predetermined threshold value corresponding to a characteristic of a 
picture. 

A fifth aspect of the present invention is a motion detecting 
apparatus, comprising an extracting means for extracting a plurality of 

20 macro blocks from a picture, a first motion detecting means for detecting 
a motion vector of each of the plurality of macro blocks extracted by the 
extracting means, a motion calculating means for calculating a motion 
vector of the entire picture with motion vectors of individual macro blocks 
detected by the first motion detecting means, and a second motion 

25 detecting means for calculating a motion vector of each macro block with 
the motion vector calculated by the motion calculating means. 

A sixth aspect of the present invention is a motion detecting 
method, comprising the steps of (a) extracting a plurality of macro blocks 
from a picture, (b) detecting a motion vector of each of the plurality of 

30 macro blocks that have been extracted, (c) calculating a motion vector of 
the entire picture with motion vectors of individual macro blocks that 
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have been detected, and (d) calculating a motion vector of each macro 
block with the motion vector that have been calculated. 

A seventh aspect of the present invention is a picture encoding 
apparatus, comprising a motion detecting means for detecting a motion 
5 vector of a predetermined pixel block of input picture data and generating 
motion residual information, a determining means for comparing the 
motion residual information received from the motion detecting means 
with a predetermined value and generating a determined result, a picture 
data process means for performing a predetermined process for picture 
10 data f the predetermined process being required for an encoding process, 
an encoding means for performing the encoding process for picture data, 

p and a controlling means for skipping the predetermined process performed 

by the picture data process means corresponding to the determined result 
of the determining means and causing the encoding means to perform the 

£3 15 encoding process. 

III An eighth aspect of the present invention is a picture encoding 

:L method, comprising the steps of (a) detecting a motion vector of a 

s 4 predetermined pixel block of input picture data and generating motion 

|y residual information, (b) comparing the motion residual information with a 

rf 20 predetermined value and generating a determined result, (c) performing a 
predetermined process for picture data, the predetermined process being 
required for an encoding process, and (d) skipping the predetermined 
process corresponding to the determined result and performing the 
encoding process for the picture data. 
25 A ninth aspect of the present invention is a motion vector 

calculating method, comprising the steps of (a) extracting a block from a 
reference picture corresponding to a block of a current picture to be 
processed, the size of the block of the reference picture being the same 
as the size of the block of the current picture, the origin of the block of 
30 the reference picture matching the origin of the block of the current 
picture, (b) while moving the block of the reference picture in a 
predetermined search area, obtaining a residual between the block of the 
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current picture and the block of the reference picture, (c) detecting a 
block with the minimum residual from the reference picture so as to 
calculate a motion vector, (d) extracting N pixels of the current picture 
and N pixels of the reference picture at a time (where N is an integer), (e) 
5 storing the N pixels of the current picture and the N pixels of the 

reference picture as successive data to a memory, and (f) reading pixels 
of the block of the current picture and pixels of the block of the reference 
picture as successive data from the memory so as to obtain a residual. 
A tenth aspect of the present invention is a recording medium on 
10 which a motion vector calculating program has been recorded, the motion 
vector calculating program causing a system that has the recording 
Q medium to perform the steps of (a) extracting a block from a reference 
*% s picture corresponding to a block of a current picture to be processed, the 
;* size of the block of the reference picture being the same as the size of the 

Q 1 5 block of the current picture, the origin of the block of the reference 
[fl picture matching the origin of the block of the current picture, (b) while 

L. moving the block of the reference picture in a predetermined search area, 

obtaining a residual between the block of the current picture and the 
block of the reference picture, (c) detecting a block with the minimum 
20 residual from the reference picture so as to calculate a motion vector, (d) 
extracting N pixels of the current picture and N pixels of the reference 
picture at a time {where N is an integer), (e) storing the N pixels of the 
current picture and the N pixels of the reference picture as successive 
data to a memory, and (f ) reading pixels of the block of the current 
25 picture and pixels of the block of the reference picture as successive data 
from the memory so as to obtain a residual. 

An eleventh aspect of the present invention is a motion vector 
calculating method, comprising the steps of (a) extracting a block from a 
reference picture corresponding to a block of a current picture to be 
30 processed, the size of the block of the reference picture being the same 
as the size of the block of the current picture, the origin of the block of 
the reference picture matching the origin of the block of the current 
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picture, (b) while moving the block of the reference picture in a 
predetermined search area, obtaining a residual between the block of the 
current picture and the block of the reference picture, (c) detecting a 
block with the minimum residual from the reference picture so as to 
5 calculate a coarse motion vector, (d) while moving the block of the 

reference picture in the vicinity of the coarse motion vector obtained at 
step (c), obtaining a residual between the block of the current picture and 
the block of the reference picture, (e) detecting a block with the minimum 
residual from the reference picture so as to detect a fine motion vector, 

1 0 (f ) storing pixels of the current picture and pixels of the reference picture 
to a first memory, (g) extracting N pixels of the current picture and N 
pixels of the reference picture at a time fwhere N is an integer), and (h) 
storing the N pixels of the current picture and the N pixels of the 
reference picture as successive data to a second memory, wherein step 

1 5 (c) is performed with the N pixels of the current picture and the N pixels 
of the reference picture stored as successive data in the second memory, 
and wherein step (e) is performed with the pixels of the current picture 
and the pixels of the reference picture stored in the first memory. 

A twelfth aspect of the present invention is a recording medium on 

20 which a motion vector calculating program has been recorded, the motion 
vector calculating program causing a system that has the recording 
medium to perform the steps of (a) extracting a block from a reference 
picture corresponding to a block of a current picture to be processed, the 
size of the block of the reference picture being the same as the size of the 

25 block of the current picture, the origin of the block of the reference 

picture matching the origin of the block of the current picture, (b) while 
moving the block of the reference picture in a predetermined search area, 
obtaining a residual between the block of the current picture and the 
block of the reference picture, (c) detecting a block with the minimum 

30 residual from the reference picture so as to calculate a coarse motion 

vector, (d) while moving the block of the reference picture in the vicinity 
of the coarse motion vector obtained at step (c), obtaining a residual 
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between the block of the current picture and the block of the reference 
picture, (e) detecting a block with the minimum residual from the 
reference picture so as to detect a fine motion vector, (f) storing pixels of 
the current picture and pixels of the reference picture to a first memory, 
5 (g) extracting N pixels of the current picture and N pixels of the reference 
picture at a time (where N is an integer), and (h) storing the N pixels of 
the current picture and the N pixels of the reference picture as successive 
data to a second memory, wherein step (c) is performed with the N pixels 
of the current picture and the N pixels of the reference picture stored as 
10 successive data in the second memory, and wherein step (e) is performed 
with the pixels of the current picture and the pixels of the reference 
picture stored in the first memory. 

A thirteenth aspect of the present invention is a motion vector 
calculating method, comprising the steps of (a) extracting a block from a 
1 5 reference picture corresponding to a block of a current picture to be 

processed, the size of the block of the reference picture being the same 
as the size of the block of the current picture, the origin of the block of 
the reference picture matching the origin of the block of the current 
picture, (b) while moving the block of the reference picture in a 
20 predetermined search area, obtaining a residual between the block of the 
current picture and the block of the reference picture, (c) detecting a 
block with the minimum residual from the reference picture so as to 
calculate a motion vector, and (d) comparing contour pixels of the block 
of the reference picture with contour pixels of the block of the current 
25 picture so as to obtain a residual therebetween. 

A fourteenth aspect of the present invention is a recording medium 
on which a motion vector calculating program has been recorded, the 
motion vector calculating program causing a system that has the 
recording medium to perform the steps of (a) extracting a block from a 
30 reference picture corresponding to a block of a current picture to be 

processed, the size of the block of the reference picture being the same 
as the size of the block of the current picture, the origin of the block of 
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the reference picture matching the origin of the block of the current 
picture, (b) while moving the block of the reference picture in a 
predetermined search area, obtaining a residual between the block of the 
current picture and the block of the reference picture, (c) detecting a 
5 block with the minimum residual from the reference picture so as to 

calculate a motion vector, and (d) comparing contour pixels of the block 
of the reference picture with contour pixels of the block of the current 
picture so as to obtain a residual therebetween. 

These and other objects, features and advantages of the present 
10 invention will become more apparent in light of the following detailed 
description of a best mode embodiment thereof, as illustrated in the 
accompanying drawings. 

% ft 
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BRIEF DESCRIPTION OF THE DRAWINGS 



Fig. 1 is a block diagram showing the structure of a conventional 
MPEG2 encoder; 

5 Fig. 2 is a schematic diagram for explaining a block matching 

process; 

Fig. 3 is a schematic diagram showing the structure of a 
conventional moving picture encoding apparatus; 

Fig. 4 is a block diagram showing an example of the structure of a 
10 data processing apparatus according to the present invention; 

Fig. 5 is a flow chart for explaining an MPEG2 encoding process; 

Fig. 6 is a schematic diagram for explaining a process of a block of 
the current frame in a motion vector calculating process according to the 
present invention; 

1 5 Fig. 7 is a schematic diagram for explaining a process of a block of 

the current frame in the motion vector calculating process according to 
the present invention; 

Fig. 8 is a schematic diagram for explaining a process of a block of 
the current frame in the motion vector calculating process according to 
20 the present invention; 

Fig. 9 is a schematic diagram for explaining a zigzag scan process; 
Fig. 10 is a schematic diagram for explaining a process of a block 
of a reference frame in the motion vector calculating process according to 
the present invention; 
25 Fig. 1 1 is a schematic diagram for explaining a process of a block 

of a reference frame in the motion vector calculating process according to 
the present invention; 

Fig. 12 is a schematic diagram for explaining a process of a block 
of a reference frame in the motion vector calculating process according to 
30 the present invention; 

Fig. 13 is a graph showing a function for determining whether an 
intra-MB encoding process or an inter-MB encoding process is performed; 
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Fig. 14 is a graph showing a function for determining whether an 
intra-MB encoding process or an inter-MB encoding process is performed; 

Fig. 1 5 is a flow chart for explaining a motion vector calculating 
process according to the present invention; 

Fig. 1 6 is a flow chart for explaining a motion vector calculating 
process according to the present invention; 

Fig. 17 is a schematic diagram for explaining a checkerwise thin- 
out process; 

Figs. 18A, 18B, and 1 8C are schematic diagrams for explaining an 
arrangement of checkerwise data as successive data; 

Figs. 19A and 1 9B are schematic diagrams for explaining an 
encoding process of an MPEG2 encoder -according to the present 
invention; 

Fig. 20 a schematic diagram for explaining a memory structure 
used in an encoding process of the MPEG2 encoder according to the 
present invention; 

Fig. 21 is a timing chart for explaining an encoding process of the 
MPEG2 encoder according to the present invention; 

Fig. 22 is a flow chart for explaining a motion vector calculating 
process of the MPEG2 encoder according to the present invention; 

Fig. 23 is a flow chart for explaining a motion vector calculating 
process of the MPEG2 encoder according to the present invention; 

Fig. 24 is a flow chart for explaining a motion vector calculating 
process of the MPEG2 encoder according to the present invention; 

Fig. 25 is a schematic diagram for explaining an embodiment of the 
present invention; 

Fig. 26 is a flow chart for explaining an embodiment of the present 

invention; 

Fig. 27 is a block diagram showing an example of the structure of a 
picture encoding apparatus according to the present invention; 
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Fig. 28 is a schematic diagram for explaining a macro block 
extracting process used in a global vector detecting portion of the picture 
encoding apparatus; 

Fig. 29 is a schematic diagram for explaining a process for dividing 
one picture into a plurality of areas and obtaining global vectors, the 
process being performed by the global vector detecting portion of the 
picture encoding apparatus; 

Fig. 30 is a flow chart for explaining a motion detecting process of 
the picture encoding apparatus; 

Fig. 31 is a block diagram showing the structure of a picture 
encoding apparatus according to the present invention; and 

Fig. 32 is a flow chart for explaining an encoding process of the 
picture encoding apparatus according to the present invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



Next, with reference to the accompanying drawings, embodiments 
of the present invention will be described. Fig. 4 is a block diagram 
5 showing an example of the structure of a data processing apparatus 
according to a first embodiment of the present invention. 

Referring to Fig. 4, reference numeral 1 is a CPU (Central 
Processing Unit). Reference numeral 2 is a ROM (Read Only Memory). 
Reference numeral 3 is a RAM (Random Access Memory). The CPU 1, 
10 the ROM 2, and the RAM 3 are connected to a processor bus 4. 

The CPU 1 is, for example, a processor having an MMX function. 
The MMX function allows a moving picture reproducing process, a picture 
editing process, a sound synthesizing process and so forth to be 
performed at high speed. With an MMX instruction that employs SIMD 
15 (Single Instruction Multiple Data) technology, the same process can be 
performed for successive data at one time. 

The ROM 2 stores a boot strap program. The RAM 3 is a main 
memory, used as a working area. The recommended storage capacity of 
the RAM 3 is, for example, 64 MB or more. 
20 The CPU 1 is connected to a bridge circuit 5. The bridge circuit 5 

is connected to the processor bus 4. The bridge circuit 5 is connected to 
a PCI (Peripheral Component Interconnect) bus 6. The bridge circuit 5 
connects the CPU 1 , the processor bus 4, and the PCI bus 6. 
The PCI bus 6 is connected to an IDE (Integrated Device 
25 Electronics) controller 7, a SCSI (Small Computer System Interface) 

controller 8, a graphics accelerator 9, and an IEEE (Institute Of Electrical 
and Electronics Engineers) 1394 controller 10. 

The IDE controller 7 is connected to a storage device 1 1 such as a 
hard disk drive or a CD drive. The SCSI controller 8 is connected to a 
30 storage device 12 such as a hard disk drive or a CD drive. The SCSI 
controller 8 is also connected to a peripheral unit, such as an image 
scanner, as well as a storage device. The graphics accelerator 9 is 
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connected to a display 13. The IEEE 1394 controller 10 is connected to a 
digital audio video unit such as a digital VCR (Video Cassette Recorder). 

The PCI bus 6 is connected to an ISA (Industrial Standard 
Architecture) bus 15 through a bridge circuit 14. The bridge circuit 14 
5 connects the PCI bus 6 and the ISA bus 1 5. The ISA bus 1 5 is 

connected to an input device controller 16, a floppy disk controller 17, a 
parallel controller 18, and an RS232C controller 19. 

The input device controller 1 8 is connected to an input device 20 
such as a keyboard or a mouse. The floppy disk controller 1 7 is 
10 connected to a floppy disk drive 21 . A printer or the like can be 
connected to the parallel controller 18. Modem or the like can be 
connected to the RS232C controller 19.* 

In the initial state, the boot strap program stored in the ROM 2 gets 
started so as to establish initial settings. Thereafter, storage device 1 1 or 
15 1 2 is accessed. An operating system stored in the storage device 1 1 or 
12 is read. The operating system resides in the RAM 3 as a main 
memory. Thus, the operating system gets started. Under the control of 
the operating system, various processes are executed. 

In the example, the PCI bus and the ISA bus are used. However, 
20 according to the present invention, USB (Universal Serial Bus) can be 
used. A keyboard, a mouse, or the like can be connected to the USB. 

When the data processing apparatus shown in Fig. 4 performs the 
MPEG2 encoding process, an application program for performing the 
MPEG2 encoding process is executed. The application program is stored 
25 as an executable program in the storage device 1 1 such as an IDE hard 
disk or the storage device 1 2 such as a SCSI hard disk. When the 
application program is executed, it is read to the RAM 3 and sequentially 
executed by the CPU 1 . 

The application program for performing the MPEG2 encoding 
30 process may be pre-installed to the storage device 1 1 such as an IDE hard 
disk or the storage device 1 2 such as a SCSI hard disk. Alternatively, a 
CD-ROM or a floppy disk may be provided with the application program 
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for performing the MPEG2 encoding process in an executable format or a 
compressed format. The user may install the program stored in the CD- 
ROM or the floppy disk to the storage device 1 1 such as an IDE hard disk 
or the storage device 1 2 such as a SCSI hard disk. As another alternative 
5 method, the application program may be downloaded through a 
communication line. 

When the application program for performing the MPEG2 encoding 
process is executed, a motion vector calculating process, a DCT 
calculating process, a quantizing process, and a variable length code 

1 0 encoding process are performed for digital video data corresponding to a 
prediction mode. The digital video data are compressed corresponding to 
the MPEG2 method. At this point, as a Vvorking area, the RAM 3 is used. 
Calculating operations for such processes are performed by calculating 
functions of the CPU 1 . The digital video data are input from an external 

1 5 digital VCR or the like connected to the IEEE 1 394 controller 10. Output 
data are recorded to a hard disk drive or the like connected to the SCSI 
controller 8 or the IDE controller 7. 

Fig. 5 is a flow chart showing the MPEG2 encoding process of the 
program. 

20 As shown in Fig. 5, digital video data of a plurality of frames are 

input. The digital video data are buffered to the RAM 3 (at step S1 ). By 
a block matching process, a motion vector is calculated (at step S2). In 
the block matching process, contour pixels of blocks may be used. 

In step S3, it is determined whether or not the prediction mode is 

25 an I picture, a forward-predictive P picture, or a bi-directionally predictive 
B picture. When the prediction mode is an I picture as the determined 
result at step S3, the DCT process for each block of (8 x 8) pixels of the 
same frame is performed (at step S4). The obtained coefficient data are 
quantized (at step S5) and then encoded with variable length code (at 

30 step S6). The resultant data is stored as data of a reference picture to 
the RAM 3 (at step S7). 
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When the prediction mode is a P picture as the determined result at 
step S3, data of a forward reference picture is read from the RAM 3 (at 
step S8). At step S9, the reference picture is motion-compensated 
corresponding to the motion vector calculated at step S2. Thus, the 
5 difference is obtained between the data of the current picture and the 
data of the reference picture that has been motion-compensated. The 
DCT process is performed for the difference between the data of the 
current picture and the data of the reference picture (at step S10). The 
obtained data are quantized (at step S1 1) and then encoded with variable 

10 length code (at step S12). The resultant data are stored as data of the 
reference picture to the RAM 3 (at step S13). 

When the prediction mode is a B picture as the determined result at 
step S3, data of bidirectional reference pictures are read from the RAM 3 
(at step S14). The reference picture is motion-compensated 

15 corresponding to the motion vector calculated at step S2 (at step S15). 

The difference is obtained between the data of the current picture and the 
data of the reference pictures that have been motion-compensated. The 
DCT process is performed for the difference between the data of the 
current picture and the data of the reference pictures (at step S16). The 

20 obtained data are quantized (at step S17) and encoded with variable 
length code (at step S18). 

The motion vector calculated at step S2 shown in Fig. 5 is 
performed in the following manner. A block with the same size and the 
same origin as a block divided from the current frame to be processed is 

25 extracted from a reference frame. While the block of the reference frame 
is being moved in a predetermined search area, the sum of absolute 
values of difference values between pixels of the block of the reference 
frame and pixels of the relevant block of the current frame is obtained as 
a residual. A block of the reference frame with the minimum residual is 

30 obtained. The block matching process requires many calculating steps. 
Thus, according to the present invention, the block matching 
process is performed by orthogonally transforming data of blocks and 
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comparing the blocks. As an example of the orthogonally transforming 
process, a Hadamard transforming method may -be used. 

In other words, as shown in Fig. 6, data pieces CD1, CD2, and 
CD256 of a block CBLK of (1 6 x 16) pixels of the current frame are 
5 obtained. As shown in Fig. 7, the block CBLK of (16 x 16) pixels of the 
current frame is divided into four blocks TBLK-C1 to TBLK-C4, each of 
which is composed of (8 x 8) pixels. As shown in Fig. 8, the four blocks 
TBLK-C1 to TBLK-C4 are orthogonally transformed into spectrum data 
pieces TCD1-1 to TCD1-64, TCD2-1 to TCD2-64, TCD3-1 to TCD3- 64, 

10 and TCD4-1 to TCD4-64. Data pieces of the four blocks TBLK-C1 to 
TBLK-C4 are obtained, e.g., in the order of lower spatial frequency data 
pieces by zigzag scanning method as shown in Fig. 9. 

Likewise, as shown in Fig. 10, data pieces RD1, RD2, and 
RD256 of a block RBLK of (16 x 16) pixels of a reference frame are 

1 5 obtained. As shown in Fig. 1 1 , the block RBLK of the reference frame is 
divided into four blocks TBLK-R1 to TBLK-R4. As shown in Fig. 12, the 
four blocks TBLK-R1 to TBLK-R4 are orthogonally transformed into 
spectrum data pieces TRD1-1 to TRD1-64, TRD2-1 to TRD2-64, TRD3-1 
to TRD3-64, and TRD4-1 to TRD4-64. Data pieces of the four blocks 

20 TBLK-R1 to TBLK-R4 are obtained, e.g., in the order of lower spatial 
frequency data pieces by zigzag scanning method as shown in Fig. 9. 

When a video signal is orthogonally transformed, energy 
concentrates on low frequency data. Thus, high frequency data almost 
does not exist. Consequently, when data pieces of the four blocks 

25 TBLK-C1 to TBLK-C4 are obtained by the zigzag scanning method, the 
number of data pieces is limited to a predetermined value. In this 
example, the number of data pieces obtained is limited to 1 0, However, 
according to the present invention, the number of data pieces obtained 
may be a value other than 10. Likewise, when data pieces of the four 

30 blocks TBLK-R1 to TBLK-R4 of the reference frame are obtained by the 
zigzag scanning method, the number of data pieces obtained is limited to 
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a predetermined value. In this example, the number of data pieces 
obtained is limited to 1 0. 

In other words, for example, 10 data pieces (denoted by black dots 
shown in Fig. 8) are obtained from the four blocks TBLK-C1 to TBLK-4 of 
5 the current frame. Likewise, for example, 10 data pieces (denoted by 
black dots shown in Fig. 12) are obtained from the four blocks TBLK-R1 
to TBLK-R4 of the reference frame. The sum of the absolute values of 
the difference values between the data pieces obtained from the four 
blocks TBLK-C1 to TBLK-C4 of the current frame and the data pieces 
10 obtained from the fourth blocks TBLK-R1 to TBLK-R4 of the reference 
frame is obtained as a residual. 

In the block matching process, sirfce data of one block is 
\U orthogonally transformed and the number of data pieces is limited to a 

il predetermined value, the number of calculating steps can be remarkably 

J2 15 decreased. Thus, the calculating speed is improved. 
E[J In other words, as described above, one block is divided into four 

s blocks. The four blocks are orthogonally transformed (by for example 

!y Hadamard transforming method). The number of data pieces of each 

orthogonally transformed block is limited to 10. In this condition, the 
C3 20 block matching process is performed. In this case, since the number of 
data pieces is limited to 10 and one block is divided into four blocks, the 
number of calculating steps for obtaining the residual in the block 
matching process is 40. In contrast, when the block matching process is 
performed with a block composed of (16 x 16) pixels, the number of 
25 calculating steps becomes (16x 16 = 256). Thus, when a residual is 
obtained with one block that is orthogonally transformed, the number of 
calculating steps can be remarkably decreased. 

In this case, the orthogonally transforming method such as 
Hadamard transforming method should be used. However, the Hadamard 
30 transforming method can be performed with simple arithmetic operations 
such as additions and subtractions. Thus, the number of calculating 
steps does not largely increase. 
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In the MPEG2 encoding process, a picture of the current frame is 
used as a picture of the next reference frame. Thus, when orthogonally 
transformed data of a block of a picture of the current frame is stored, it 
can be used as data of a reference frame. 
5 When a motion vector is searched, search areas overlap. In an 

overlapped area, the same orthogonally transformed data are required. 
Thus, for a block of a reference frame, orthogonally transformed data that 
have been moved pixel by pixel are stored. In this case, when search 
areas overlap, the stored data can be used. 

10 In the above-described example, as the orthogonally transforming 

method, the Hadamard transforming method was used. However, 
according to the present invention, for example, a DCT transforming 
method or an FFT (Fast Fourier Transform) can be used. 

In the above-described example, a block of (16 x 16) pixels is 

1 5 divided into four blocks, each of which is composed of (8 x 8) pixels. 
The four divided blocks are orthogonally transformed. Alternatively, a 
block of (16 x 16) pixels may be directly orthogonally transformed. 
However, when one block is divided into four blocks and the four blocks 
are orthogonally transformed, the transforming algorithm becomes simple. 

20 In addition, a general-purpose orthogonally transforming circuit and 
algorithm can be used. 

Next, a second embodiment of the present invention will be 
described. According to the second embodiment, in the middle of the 
block matching calculating loop, the sum of the absolute values of the 

25 difference values between pixels of a block of the current frame and 
pixels of the relevant block of a reference frame is compared with a 
predetermined threshold value. When the sum is equal to or larger than 
the predetermined threshold value, the process is terminated. Thus, the 
number of calculating steps can be decreased. Consequently, a motion 

30 vector can be detected at high speed. 

The threshold value is assigned corresponding to the sum of mean 
discrete absolute values (MAD) and a residual AD (0, 0) at the origin. 
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Thus, since the threshold value is dynamically assigned, the process can 
be effectively performed. 

In the case of a P picture and a B picture, the intra-MB encoding 
process may be performed for each macro block (in Fig. 5, for simplicity, 
5 in the case of a P picture and a B picture, the inter-MB encoding process 
is performed with a reference frame). In other words, the inter-MB 
encoding process can compress a picture more effectively than the 
intra-MB encoding process. However, in the case of a picture that 
contains many DC components or a picture that moves a lot (namely, the 

10 sum of the absolute values of the difference values between pixels of a 
block of the current frame and pixels of the relevant block of a reference 
frame is large), the intra-MB encoding prt>cess can compress the picture 
more effectively than the inter-MB encoding process. When the intra-MB 
encoding process is performed, since the motion vector calculating 

1 5 process is not required, inaccuracy of the motion vector is permissible. 

When the sum of the absolute values of the difference values 
between pixels of a block of the current block and pixels of the relevant 
block of a reference block is equal to or larger than the predetermined 
threshold value, the process is terminated. Thus, if a large value is 

20 assigned to the threshold value, the probability of the process terminating 
in the middle becomes high. In contrast, when a small value is assigned 
to the threshold value, the probability of a motion vector being 
inaccurately detected becomes high. However, when the intra-MB 
encoding process is performed, since the motion vector calculating 

25 process is not required, inaccuracy of the motion vector is permissible. 
Thus, when a residual obtained in the intra-MB encoding process is used 
as the threshold value, the efficiency of the process is improved. 

When a P picture and a B picture are encoded, the intra-MB 
encoding process is performed corresponding to the value of the MAD 

30 and the residual AD (x, y) of the detected motion vector. 

The MAD is the sum of the absolute values of the difference values 
between the values of pixels of one frame and the mean value thereof. 
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The MAD represents the complexity of a pattern of one block of a picture. 
Thus, when a pattern is simpte, the value of the MAD is small. In 
contrast, when a pattern is complicated, the value of the MAD is large. 
Thus, with a function shown in Fig. 13, it is determined whether 
5 the intra-MB encoding process or the inter-MB encoding process is 
performed. In Fig. 13, the horizontal axis represents the value of the 
residual AD (x, y) at the position of a motion vector, whereas the vertical 
axis represents the value of the MAD. In Fig. 13, when both the value of 
the MAD and the value of the residual (x, y) at the position of a motion 

10 vector are in an area AR1, the intra-MB encoding process is performed. 
When they are in an area AR2, the inter-MB encoding process is 
performed. When the value of the MAD'is small, since the pattern of the 
current block is simple, this function represents that the intra-MB 
encoding process is performed. When the value of the residual AD (x, y) 

1 5 at the position of the motion vector is small, this function represents that 
the inter-MB encoding process is performed instead of the intra-MB 
encoding process. 

During the block matching process, the sum of the absolute values 
of the difference values between pixels of a block of the current frame 

20 and pixels of the relevant block of a reference frame is compared with a 
predetermined threshold value. When the sum is equal to or larger than 
the threshold value, the process is terminated. In addition, corresponding 
to functions shown in Figs. 1 3 and 1 4 with the value of the MAD and the 
value of the residual AD (0, 0) at the origin, it is determined whether the 

25 intra-MB encoding process or the inter-MB encoding process is performed. 
In addition, it is determined whether or not the motion compensation is 
performed. Based on the determined results, the initial threshold value is 
assigned. Thus, a motion vector can be effectively calculated. Fig. 1 5 
shows a flow chart showing such a process. 

30 Since the threshold value that is initially assigned in the block 

matching process is not always small, when a motion vector is detected 
at first, the threshold value is obtained with the value of the MAD and the 
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value of the residual AD (0, 0) at the origin. In the next block matching 
process for the same block, the original threshold value or the obtained 
sum, whichever is smaller, is used. Thus, the process can be effectively 
performed. 

5 In Fig. 1 5, the search area of a block of a reference frame is initially 

set (at step S21). Thereafter, the values of the MAD is obtained (at step 

522) . The value of the residual AD (0, 0) at the origin is obtained (at step 

523) . Corresponding to the value of the MAD and the value of the 
residual AD (0, 0) at the origin, the initial value of the ADmin is set (at 

10 step S24). 

The value of the ADmin represents the minimum value of the 
residual that has been obtained. The initial value of the ADmin becomes 
the initial threshold value of the sum of the absolute values of the 
difference values between pixels of a block of the current frame and 

1 5 pixels of the relevant block of a reference frame. The value of the ADmin 
is dynamically set depending on whether the MAD or AD (0,0) is smaller. 

After the initial value of the ADmin has been set at step S24, the 
upper left position of the search area is selected for the first block 
matching process (at step S25). For a block at the initial position, the 

20 block matching process is performed (at step S26). 

During the block matching process, the sum of the absolute values 
of the difference values between pixels of a block of the current frame 
and pixels of the relevant block of a reference frame is compared with a 
predetermined threshold value. When the sum is equal to or larger than 

25 the predetermined threshold value, the process is terminated. When the 
block matching process is initially performed, as the threshold value, the 
initial value of the ADmin obtained at step S24 is used. Fig. 1 6 is a flow 
chart showing the block matching process. 

Referring to Fig. 16, in the block matching process (at step S26), a 

30 pixel position is initially set (at step S41 ). The value of the residual AD is 
initially set (at step S42). A residual is obtained with the sum of the 
absolute values of the difference values between pixels of a reference 
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frame and pixels of a current Irame (at step S43). In'the middle of the 
block matching process, it is determined whether or not the sum of the 
absolute values of the difference values between pixels of the reference 
frame and pixels of the current frame exceeds the value of the ADmin (at 
5 step S44). When the determined result at step S44 is Yes (namely, the 
sum exceeds the value of the ADmin), the block matching process is 
terminated. The flow returns to the main routine. When the determined 
result at step S44 is No (namely, the sum does not exceed the value of 
the ADmin), it is determined whether or not the block matching process 

10 has been performed for all the pixels (at step S45). When the determined 
result at step S45 is No (namely, the block matching process has not 
been performed for all the pixels), the flcfw returns to step S43. At step 
S43 f the sum of the absolute values of the difference values between 
pixels of the reference frame and pixels of the current frame is continued. 

1 5 When the determined result at step S45 is Yes (namely, the block 
matching process has been performed for all the pixels), the block 
matching process is terminated. Thereafter, the flow returns to the main 
routine. 

As described above, in the block matching process, it is determined 
20 whether or not the value of the AD exceeds the value of the ADmin at 
step S44. When the determined result at step S44 is Yes (namely, the 
value of the AD exceeds the value of the ADmin), the flow returns to the 
main routine. Thus, the value of the ADmin becomes the threshold value. 
During the block matching process, the sum of the absolute values of the 
25 difference values between pixels of a block of the current frame and 

pixels of the relevant block of the reference frame is compared with the 
threshold value. When the sum exceeds the predetermined threshold 
value, the block matching process is terminated. Thus, the number of 
calculating steps is decreased. Consequently, a motion vector can be 
30 detected at high speed. 

In addition, the value of the ADmin used as the initial value of the 
threshold value is set corresponding to the value of the MAD and the 
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value of the residual AD (0, 0) at the origin. When a residual represents 
that the intra-frame encoding process is performed, since a motion vector 
is not required, inaccuracy of the motion vector is permissible. Since the 
threshold value is dynamically varied, when a residual exceeds a value at 
5 which a motion vector is not required, the probability of which the block 
matching process is terminated becomes high. Thus, the number of 
calculating steps is further decreased. 

In Fig. 1 5, the value of the AD obtained in the block matching 
process is compared with the minimum value ADmin that has been 
10 obtained (at step S27). When the determined result at step S27 is Yes 
(namely, the value of the AD is smaller than the minimum value ADmin), 
C3 the current sum AD is used as the minimum value ADmin (at step S28). 

The value of the AD is recorded at step S29. Thereafter, the next block 
lZ * s processed (at step S30). Thereafter, it is determined whether or not 

S3 1 5 the last block has been processed (at step S31). When the determined 
Iff result at step S31 is No (namely, the last block has not been processed), 

f. n the flow returns to step S26. At step S26, the block matching process is 

y performed for the next block. 

y As shown in Fig. 16, during the block matching process, the sum 

12 20 of the absolute values of the difference values between pixels of a block 
of the current frame and pixels of the relevant block of the reference 
frame is compared with a predetermined threshold value. When the sum 
is equal to or larger than the predetermined threshold value, the process is 
terminated. As the threshold value, the value of the ADmin is used. 
25 At step S27, the value of the AD that has been obtained in the 

block matching process is compared with the value of the ADmin that has 
been obtained. When the determined result at step S27 is Yes (namely, 
the value of the AD is smaller than the current minimum value ADmin), 
the value of the AD becomes the value of the ADmin. Thus, when the 
30 value of the AD that has been obtained is larger than the value of the 
ADmin, the next threshold value is the same as the original threshold 
value. When the value of the AD is smaller than the value of the ADmin, 
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the next threshold value becomes the minimum value of the AD. Thus, in 
the block matching process shown in Fig. 16, when a residual exceeds 
the value of the ADmin, the process is terminated. 

Thereafter, a loop from step S26 to step S31 is repeated. The 
5 minimum value of the sum of the absolute values of the difference values 
between pixels of a block of the current frame and pixels of the relevant 
block of the reference frame is obtained. At step S31, it is determined 
whether or not the last block has been processed. When the determined 
result at step S31 is Yes (namely, the last block has been processed), the 
10 process is terminated. The result is stored (at step S32). 

In the above-described example, the initial threshold value is 
assigned corresponding to the value of the MAD and the value of the 
residual AD (x, y) at the origin. However, according to the present 
invention, the threshold value may be assigned corresponding to one of 
15 the value of the MAD and the value of the residual AD (x, y) at the origin. 

In a third embodiment of the present invention, as shown in Fig. 
17, the block matching process is performed by checkerwise thinning out 
pixels of a block of a reference frame and pixels of a relevant block of the 
current frame. 

20 Referring to Fig. 17, a block 31 of a reference frame is composed 

of (1 6 x 16) pixels. (8x16) pixels are obtained checkerwise from the 
block 31 . Likewise, a block 32 of the current frame is composed of (16 x 
16) pixels. (8x16) pixels are obtained checkerwise from the block 31 . 
At this point, the pixels of the current frame and the pixels of the 

25 reference frame that have been obtained checkerwise are stored as 

successive data to a memory (a predetermined area of the RAM 3) so that 
the block matching process can be performed effectively with an MMX 
instruction. 

In other words, as shown in Fig. 1 8A, pixels of the current frame 
30 and pixels of the reference frame are obtained checkerwise. As shown in 
Fig. 18B, the pixels that have been thinned out checkerwise are 
rearranged as successive data. As shown in Fig. 18C, the pixels that 
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have been thinned out checkerwise are stored to successive addresses of 
the memory. 

When the pixels of the current frame and the pixels of the 
reference frame are stored as successive data to the memory, since the 
5 block matching process can be performed with an MMX instruction, the 
process can be performed at high speed. 

When the pixels of the current frame and the pixels of the 
reference frame that have been thinned out checkerwise are stored as 
successive data to the memory, since two pixels are searched at a time, a 
10 logarithmic searching process can be easily performed as well as the 
availability of an MMX instruction. 

In the logarithmic searching process, a point with the minimum 
residual is coarsely searched in a search area. Thereafter, a point with 
the minimum residual is finely searched around the coarsely searched 
1 5 point. As a result, a motion vector is detected. 

When pixels of the current frame and pixels of the reference frame 
that have been obtained checkerwise are stored as successive data to the 
memory, the logarithmic searching process is performed in the following 
manner. 

20 A first memory that stores all pixels of the current frame and all 

pixels of the reference frame is prepared. A second memory (or a 
memory area) that stores pixels of the current frame and pixels of the 
reference frame that have been obtained checkerwise as successive data 
is also prepared. Using the second memory, a motion vector is searched 

25 for coarsely using two-pixel steps. After a motion vector has been 
coarsely detected, using the first memory, a motion vector is finely 
searched for in the vicinity of the obtained point pixel by pixel. Thus, a 
motion vector can be finally detected. 

For example, as shown in Fig. 19A, picture data pieces F1, F2, F3, 

30 F4, F5, F6, F7, and so forth of each frame are input. The input picture 
data pieces F1, F2, F3, F4, F5, F6, F7, and so forth are encoded into 
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MPEG2 picture data pieces P1 , P2, P3, P4, P5, P6, P7, and so forth in 
the order of I, B, B, P, B, B, and P pictures. 

In such an encoding process, as shown in Fig. 20, to obtain a 
motion vector, the working area RAM 3 has memory areas 21 A to 21 F 
5 and memory areas 22A to 22C. The memory areas 21 A to 21 F store all 
pixels of one frame. The memory areas 22A to 22C store pixels of one 
frame that have been obtained checkerwise as successive data. 

When the picture data pieces F1, F2, F3, and so forth are input as 
shown in Fig. 21 , the picture data pieces of each frame are stored to the 
10 memory areas 21 A to 21 F. In addition, pixels are obtained checkerwise 
from the picture data pieces sample by sample. The resultant pixels are 
arranged to successive addresses and stored as picture data pieces f1, f2, 
f3, and so forth to the memory areas 22A to 22C. 

In other words, at time point T1 , the picture data piece F1 is stored 
1 5 in the memory area 21 A. At time point T2, the picture data piece F2 is 
stored in the memory area 21 B. At time point T3, the picture data piece 
F3 is stored in the memory area 21 C. At time point T4, the picture data 
piece F4 is stored in the memory area 21 D. 

At time point T4, pixels are obtained from the picture data piece F1 
20 checkerwise, sample by sample. The picture data piece f 1 , arranged to 
successive addresses, is stored in the memory area 22A. Pixels are 
obtained from the picture data piece F4 checkerwise, sample by sample. 
The picture data piece f4, arranged in successive addresses, is stored in 
the memory area 22B. 
25 At time point T5, the picture data piece F5 is stored in the memory 

area 21 E. Pixels are obtained from the picture data piece F2 
checkerwise, sample by sample. The picture data piece f2, arranged in 
successive addresses, is stored in the memory area 22C. 

At time point T6, the picture data piece F6 is stored in the memory 
30 area 21 F. Pixels are obtained from the picture data piece F3 checkerwise, 
sample by sample. The picture data piece f3, arranged in successive 
addresses, is stored in the memory area 22C. 



- 38 - 



At time point T7, the picture data piece F7 is stored in the memory 
area 21 A. Pixels are obtained from the picture data piece F7 
checkerwise, sample by sample. The picture data piece f7, arranged in 
successive addresses, is stored in the memory area 22A. 
5 As shown in Fig. 21 , picture data pieces of each frame are stored 

in the memory areas 21 A to 21 F. In addition, pixels are obtained from 
picture data pieces checkerwise, sample by sample. Picture data pieces, 
arranged in successive addresses, are stored in the memory areas 22A to 
22C. 

1 0 With the picture data pieces F1 , F2, F3, and so forth stored in the 

memory areas 21 A to 21 F and the picture data pieces f 1 , f2, f3, and so 
forth stored in the memory areas 22A tcf 22C, a motion vector is 
obtained. A motion vector is searched in a predetermined search area 
using two-pixel steps. A motion vector is searched in the vicinity of the 

1 5 searched point pixel by pixel. In other words, a motion vector is obtained 
by the logarithmic searching process. 

Since the picture data piece P1 is an I picture, it can be encoded 
from time point T1 to time point T3. 

At time point T4, the picture data piece P4 that is a P picture is 

20 encoded. A motion vector of the picture data piece P4 is obtained. For 
the picture data piece P4, the picture data piece F1 is used as a reference 
frame and the picture data piece F4 is used as the current frame. In this 
case, in the course searching process (using two-pixel steps), as a block 
of a reference frame, the picture data piece f 1 stored in the memory area 

25 22A is used. As a block of the current frame, the picture data piece f4 
stored in the memory area 22B is used. In the fine searching process 
(using one-pixel steps), as a block of the reference block, the picture data 
piece F1 stored in the memory area 21 A is used. As a block of the 
current frame, the picture data piece F4 stored in the memory area 21 D is 

30 used. 

At time point T5, the picture data piece P2 that is a B picture is 
encoded. A motion vector of the picture data piece P2 is obtained. For 
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the picture data piece. P2, as reference frames, the picture data pieces F1 
and F4 are used. As the current frame, the picture data piece F2 is used. 
In this case, in the coarse searching process (using two-pixel steps), as 
blocks of the reference frames, the picture data piece f 1 stored in the 
5 memory area 22A and the picture data piece f4 stored in the memory 
area 22B are used. As a block of the current frame, the picture data 
piece f2 stored in the memory area 22C is used. In the fine searching 
process {using one-pixel steps), as blocks of the reference frames, the 
picture data piece F1 stored in the memory area 21 A and the picture data 
10 piece F4 stored in the memory area 21 D are used. As a block of the 

current frame, the picture data piece F2 stored in the memory area 21 B is 
used. 

At time point T6, the picture data piece P3 that is a B picture is 
encoded. A motion vector of the picture P3 is obtained. For the picture 

15 data piece P3, as reference frames, the picture data pieces F1 and F4 are 
used. As the current frame, the picture data piece F3 is used. In this 
case, in the coarse searching process (using two-pixel steps), as blocks of 
the reference frames, the picture data piece f 1 stored in the memory area 
22A and the picture data piece f4 stored in the memory area 22B are 

20 used. As a block of the current frame, the picture data piece f3 stored in 
the memory area 22C is used. In the fine searching process (using one- 
pixel steps), as blocks of the reference frames, the picture data piece F1 
stored in the memory area 21 A and the picture data piece F4 stored in 
the memory area 21 D are used. As a block of the current block, the 

25 picture data piece F3 stored in the memory area 21 C is used. 

At time point T7, the picture data piece P7 that is a P picture is 
encoded. A motion vector of the picture P7 is obtained. For the picture 
data piece P7, as a reference frame, the picture data piece F4 is used. As 
the current frame, the picture data piece F7 is used. In this case, in the 

30 coarse searching process (using two-pixel steps), as a block of the 

reference frame, the picture data piece f4 stored in the memory area 22B 
is used. As a block of the current frame, the picture data piece f7 stored 
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in t he memory area 22A is used, in the fine searching process (using 
on e-pixeI steps), as a block of the reference frame, the picture data piece 
f4 stored in the memory area 21 D is used. As a block of the current 
frame, the picture data piece F7 stored in the memory area 21 A is used. 

Similarly, at time point T8, a motion vector of the picture data 
piece P5 that is a B picture is obtained. At time point T9, a motion vector 
of the picture data piece P6 that is a B picture is obtained. 

Fig. 22 is a flow chart showing a logarithmic searching process for 
calculating a motion vector. In Fig. 22, input picture data are stored. 
Pixels are extracted checkerwise from a reference frame and the current 
frame sample by sample. The pixels that have been thinned out 
checkerwise are arranged as successive 'data and stored (at step S121). 

Thereafter, it is determined whether or not all blocks of the picture 
have been processed (at step S122). 

When the determined result at step S1 22 is No (namely, all the 
blocks of the picture have not been processed), while the block is being 
moved in a predetermined search area using two-pixel steps, a motion 
vector is searched for (at step S123). 

After a motion vector has been detected, while the block is being 
moved in the vicinity of the detected motion vector pixel by pixel, a 
motion vector is searched (at step S124). 

The detected result is stored (at step S125). Thereafter, the next 
block is processed (at step S126). Thereafter, the flow returns to step 
S122. When the determined result at step S122 is No (namely, all the 
blocks have not been processed), the similar process is repeated. Thus, 
the motion vector of the next block is obtained. After the motion vector 
of the last block of picture has been obtained, since the determined result 
at step S1 22 is Yes, the process is completed. 

Fig. 23 is a flow chart showing the coarse searching process (using 
two-pixel steps) at step S123 shown in Fig. 22. In the coarse searching 
process, pixels of the current frame and pixels of the reference frame are 
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extracted checkerwise. The extracted pixels are stored as successive 
data to a memory. 

In Fig. 23, the start point of the search area is set (at step S131). 
The vertical search start position is reset to the upper end (at step S1 32). 
5 In the vertical direction, it is determined whether or not the lower end has 
been detected (at step S133). When the determined result at step S133 
is No (namely, the lower end has not been detected), the horizontal 
position is reset to the left end (at step S134). 

Thereafter, it is determined whether or not the right end of the 
10 search area has been detected (at step S135). When the determined 
result at step S1 35 is No (namely, the right end of the search area has 
not been detected), the block matching process is performed for a 
%| checkerwise block of (8x16) pixels and a residual is obtained (at step 

P S136). 

pi 15 At this point, it is determined whether or not the value of the 

residual AD is smaller than the minimum value ADmin that has been 
s obtained (at step S137). When the determined result at step S137 is Yes 

£3 

T| (namely, the value of the residual AD is smaller than the minimum value 

ADmin), the value of the residual AD is the minimum value ADmin. In 

C3 20 addition, the motion vector MV is the current position (at step S138). 

Thereafter, the horizontal position is moved for two pixels (at step S139). 
The pixels of the current frame and the pixels of the reference frame are 
extracted checkerwise and stored as successive data to the memory. 
Thus, when the horizontal position is moved for two pixels, the address is 
25 moved for one position in the memory. 

When the determined result at step S137 is No (namely, the value 
of the residual AD is not smaller than the minimum value ADmin), the 
flow advances to step S1 39. At step S1 39, the horizontal position is 
moved for two pixels. Thereafter, the flow returns to step S135. 
30 At step S135, it is determined whether or not the right end of the 

search area has been detected. When the determined result at step S135 
is No (namely, the right end of the search area has not been detected), 
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the similar process is repeated. Thus, while the block is being moved to 
the right, residuals are obtained. The minimum residual is stored as the 
minimum ADmin. 

When the determined result at step S135 is Yes (namely, the right 
5 end of the search area has been detected), the block is vertically moved 
for two pixels (at step S140). Thereafter, the flow returns to step S133. 
Thereafter, the similar process is performed. 

When the determined result at step S133 is Yes (namely, the lower 
end of the search area has been detected), the result is stored as a 
10 motion vector MV (at step S141). The motion vector MV becomes a 
reference point of the fine searching process. 
^ Fig. 24 is a flow chart showing th'e fine searching process at step 

IS ASP 

^2 S24 shown in Fig. 22. In the fine searching process, the memory that 

^ stores all pixels of the current frame and all pixels of the reference frame 

n 15 is used. 

^ In Fig. 24, the start point is set at the upper left of the reference 

point obtained at step S141 shown in Fig. 23 (at step S151). The 
Cj vertical search start position is reset to the upper end (at step S1 52). 

H Thereafter, it is determined whether or not the lower end of the search 

O 20 area has been detected (at step SI 53). When the determined result at 
step S153 is No (namely, the lower end has not been detected), the 
horizontal position is reset to the left end (at step S1 54). 

Thereafter, it is determined whether or not the right end of the 
search area has been detected (at step SI 55). When the determined 
25 result at step S1 55 is No (namely, the right end of the search area has 
not been detected), the block matching process is performed for a block 
of (16 x 16) pixels and a residual is obtained (at step S156). 

Thereafter, it is determined whether or not the value of the residual 
AD is smaller than the minimum value ADmin that has been obtained (at 
30 step S1 57). When the determined result at step S1 57 is Yes (namely, 
the value of the residual AD is smaller than the minimum value ADmin), 
the value of the residual AD is the minimum value ADmin (at step S1 58). 
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The motion vector MV is the current position. The horizontal position is 
moved for one pixel (at step S1 59) . 

When the determined result at step S1 57 is No (namely, the value 
of the residual AD is not smaller than the minimum value ADmin), the 
5 flow advances to step S159. At step S159, the horizontal position is 
moved for one pixel. Thereafter, the flow returns to step SI 55. 

At step SI 55, it is determined whether or not the right end of the 
search area has been detected. When the determined result at step S1 55 
is No (namely, the right end of the search area has not been detected), a 
10 similar process is repeated. Thus, while the block is being moved from 
the left to the right in the search area, residuals are obtained. The 
minimum residual that has been obtained is stored as the minimum value 
ADmin. 

When the determined result at step S155 is Yes (namely, the right 

1 5 end of the search area has been detected), the block is moved for one 

pixel in the vertical direction (at step S160). Thereafter, the flow returns 
to step S153. Thereafter, the similar process is performed. 

When the determined result at step S1 53 is Yes (namely, the lower 
end of the search area has been detected), the motion vector MV is 

20 obtained and the process is completed. 

In the above-described example, pixels of the reference frame and 
pixels of the current frame are extracted checkerwise. However, 
according to the present invention, the thin-out step and the thin-out 
method are not limited to those of the above-described example. 

25 In the above-described example, when the logarithmic searching 

process is performed, in the memory that stores pixels that have been 
thinned out for each sample by the coarse searching process, the address 
is moved for each position so as to search for a motion vector using two- 
pixel steps. Alternatively, by moving the address two positions at a time, 

30 a searching process with four pixels at a time can be performed. 

Likewise, by moving the address three positions at a time, a searching 
process with nine pixels at a time can be performed. In the above 
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described example, the logarithmic searching process is performed by a 
coarse searching process which proceeds two pixels a at time and a fine 
searching process using one-pixel steps. Alternatively, the logarithmic 
searching process can be performed in a plurality of stages. 

According to the present invention, pixels of a reference frame and 
pixels of the current frame are thinned out checkerwise and then a block 
matching process is performed. At this point, the pixels of the current 
frame and the pixels of the reference frame are stored as successive data 
to a memory. Thus, when the block matching process is performed, since 
an MMX instruction can be effectively used, the process can be 
performed at high speed. 

In addition, a first memory is prepared that stores pixels of a 
current frame and pixels of a reference frame, and a second memory is 
prepared that stores pixels of the current frame and pixels of the 
reference frame that have been thinned out checkerwise are prepared. 
With the second memory, a coarse searching process using two-pixel 
steps is performed. In this case, since the pixels of the current frame and 
the pixels of the reference frame that have been thinned out checkerwise 
are stored as successive data to the second memory, when the reference 
block is moved for each position in the second memory, a motion vector 
is searched using two-pixel steps. After the motion vector has been 
obtained by the coarse searching process using two-pixel steps, using the 
first memory, the fine searching process using one-pixel steps is 
performed in the vicinity of the point obtained in the coarse searching 
process. 

Thus, when pixels of the current frame and pixels of the reference 
frame that have been obtained checkerwise are stored as successive data 
to a memory, since a motion vector is searched for using two-pixel steps, 
the logarithmic searching process can be easily performed and an MMX 
instruction is available. 

The motion vector calculating process as step S2 shown in Fig. 5 is 
performed by the block matching process. In the block matching process, 
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a block with the same size and the same origin as a block divided from 
the current frame to be processed is extracted from a reference frame. 
While the block of the reference frame is being moved in a predetermined 
search area, the sum of the absolute values of the difference values 
between pixels of the block of the reference frame and pixels of the 
relevant block of the current frame is obtained as a residual. A block of 
the reference frame with the minimum residual is obtained. Thus, 
conventionally, a residual is obtained as the sum of the absolute values of 
the difference values between pixels of a block of the current frame and 
pixels of the relevant block of a reference frame. However, since the 
number of calculating steps becomes huge using such a conventional 
method, the motion vector calculating process cannot be performed at 
high speed. 

Thus, according to a fourth embodiment of the present invention, a 
residual is obtained by calculating the sum of the absolute values of the 
difference values between pixels of the contour of a block of the current 
frame and pixels of the contour of the relevant block of a reference frame. 

In other words, in Fig. 25, one block of a reference frame is 
composed of (16 x 16) pixels. Likewise, one block of a current frame is 
composed of (16 x 16) pixels. The value of each pixel of the reference 
frame is denoted by P(Hr, Vr). Likewise, the value of each pixel of the 
current frame is denoted by P(Hc, Vc). The sum of the absolute values of 
the difference values between upper contour pixels P(Hr, Vr) to P(Hr 4- 
1 5, Vr) of the block of the reference frame and upper contour pixels P(Hc, 
Vc) to P(Hc +15, Vc) of the relevant block of the current frame is 
obtained as a residual. The sum of the absolute values of the difference 
values between left contour pixels P(Hr, Vr + 1) to P(Hr, Vr + 14) of the 
block of the reference frame and left contour pixels P(Hc, Vc + 1) to 
P(Hc, Vc + 14) of the relevant block of the current frame is obtained as a 
residual. The sum of the absolute values of the difference values 
between right contour pixels P(Hr + 15, Vr + 1) to P(Hr + 15, Vr + 14) 
of the block of the reference frame and right contour pixels P(Hc + 15, 
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Vc -f 1) of the relevant block of the current frame is calculated as a 
residual. The sum of the absolute values of the difference values 
between lower contour pixels P(Hr, Vr + 15) to P(Hr + 15, Vr + 1 5) of 
the block of the reference frame and lower contour pixels P(Hc, Vc + 1 5) 
5 to P(Hc + 1 5, Vc + 1 5) of the relevant block of the current frame is 
obtained as a contour. 

Fig. 26 is a flow chart showing a process for obtaining the sum of 
the absolute values of the difference values between contour pixels of a 
block of the current frame and contour pixels of the relevant block of a 
10 reference frame so as to obtain a residual. 
n Referring to Fig. 26, the value of the cumulation value AD is 

42 initialized to "0" (at step S221). Thereafter, the horizontal position He 
M and the vertical position Vc of a pixel of the current frame and the 
p horizontal position Hr and the vertical position Vr of a pixel of the 

1 5 reference frame are initialized (at step S222). The offset O is initialized to 
"0" (at step S223). 

Cj The absolute value of the difference value between the value of the 

^ pixel P(Hr + O, Vr) of the reference frame and the value of the pixel P(Hc 
13 + O, Vc) of the current frame is obtained as a cumulation value AD (at 

20 step S224). Thereafter, the offset O is incremented (at step S225). 

Thereafter, it is determined whether or not the offset O is less than 1 6 (at 
step S226). When the determined result at step S226 is Yes (namely, 
the offset O is less than 1 6), the flow returns to step S224. 

In a loop from step S224 to step S226, the sum of the absolute 
25 value of the difference values between the upper contour pixels P(Hr, Vr) 
to P(Hr + 15, Vr) of the block of the reference frame and the upper 
contour pixels P(Hc, Vc) to P(Hc + 1 5, Vc) of the relevant block of the 
current frame is obtained. 

In other words, since the offset O has been initialized to "0" at step 
30 S223, the absolute value of the difference value between the value of the 
upper left pixel P(Hr, Vr) of the block of the reference frame and the value 
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of the upper left pixel P(Hc, Vc) of the relevant block of the current frame 
is obtained as the cumulation value AD. 

Thereafter, the offset 0 is incremented at step S225. Thus, the 
absolute value of the difference value between the value of the pixel P(Hr 
+ 1 , Vr) of the block of the reference frame and the value of the pixel 
P(Hc + 1 , Vc) of the relevant block of the current frame is obtained and 
added to the cumulation value AD. The steps in the loop are repeated 
until the offset 0 becomes "15". Thus, the sum of the absolute values of 
the difference values between the upper contour pixels P(Hr, Vr) to P(Hr 
+ 1 5, Vr) of the block of the reference frame and the upper contour 
pixels P(Hc, Vc) to P(Hc + 1 5, Vc) of the relevant block of the current 
frame is obtained. 

Thus, in the loop from step S224 to S226, the sum of the absolute 
values of the difference values between the upper contour pixels of the 
block of the current frame and the upper contour pixels of the relevant 
block of the reference frame is obtained. Thereafter, it is determined 
whether or not the offset O is "16" at step S226. When the determined 
result at step S226 is No (namely, the offset 0 is "16"), since the right 
end of the block has been detected, the offset O is initialized to "1 " (at 
step S227). 

Thereafter, the difference value between the value of the pixel 
P(Hr, Vr + O) of the block of the reference frame and the value of the 
pixel P(Hc, Vc + O) of the relevant block of the current frame is obtained. 
The difference value between the value of the pixel P(Hr + 1 5, Vr + 0) 
of the block of the reference frame and the value of the pixel P(Hc 4- 1 5, 
Vc + 0) of the relevant block of the current frame is obtained. The sum 
of these absolute values is obtained as the cumulation value AD (at step 
S228). Thereafter, the offset O is incremented (at step S229). 
Thereafter, it is determined whether or not the offset 0 is less than "15" 
(at step S230). When the determined result at step S230 is Yes (namely, 
the offset 0 is less than "15"), the flow returns to step S228. 
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In the loop from step S228 to S230, the sum of the absolute 
values of the difference values between the left contour pixels P(Hr, Vr + 
1) to P(Hr, Vr + 14) of the block of the reference frame and the left 
contour pixels P(Hc, Vc + 1 ) to P(Hc, Vc 4- 14) of the relevant block of 
5 the current frame is obtained. In addition, the sum of the absolute values 
of the difference values between the right contour pixels P(Hr + 1 5, Vr 
+ 1 ) of the block of the reference frame and the right contour pixels P(Hc 
+ 1 5, Vc + 1 ) to P(Hc + 1 5, Vc + 1 4) of the relevant block of the 
current frame is obtained. 

10 Thereafter, it is determined whether or not the offset 0 is "15" at 

step S230. When the determined result at step S230 is No (namely, the 
offset 0 is "15"), since the lower end of the block has been detected, the 
offset O is initialized to "0" (at step S231). 

Next, the absolute value of the difference value between the pixel 

1 5 P(Hr + O, Vr + 1 5) of the block of the reference frame and the pixel 
P(Hc + O, Vc + 1 5) of the relevant block of the current frame is 
obtained as the cumulation value AD (at step S232). Thereafter, the 
offset O is incremented (at step S233). Thereafter, it is determined 
whether or not the offset O is less than "16" (at step S234). When the 

20 determined result at step S234 is Yes (namely, the offset O is less than 
"16"), the flow returns to step S232. 

In the loop from step S232 to S234, the sum of the absolute 
values of the difference values between the lower contour pixels P(Hr, Vr 
+ 1 5) to P(Hr + 1 5, Vr + 1 5) of the block of the reference frame and 

25 the lower contour pixels P(Hc, Vc + 1 5) to P(Hc + 1 5, Vc + 1 5) of the 
relevant block of the current frame is obtained. When the determined 
result at step S234 is No (namely, the offset O is "16"), since the right 
end of the block has been detected, the process is completed. 

In the loop from steps S224 to S226, the sum of the absolute 

30 values of the difference values between the upper contour pixels of the 
block of the current frame and the upper contour pixels of the relevant 
block of the reference frame is obtained. In the loop from steps S228 to 
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S230, the sum of the absolute values of the difference values between 
the left contour pixels of the block of the current frame and the left 
contour pixels of the relevant block of the reference frame is obtained. In 
addition, the sum of the absolute values of the difference values between 
the right contour pixels of the block of the current frame and the right 
contour pixels of the relevant block of the reference frame is obtained. In 
the loop from steps S232 to S234, the sum of the absolute values of the 
difference values between the lower contour pixels of the block of the 
current frame and the lower contour pixels of the relevant block of the 
re f erence frame is obtained. Thus, the sum of the absolute values of the 
difference values (between the contour pixels of the four sides of the 
block of the current frame and the contour pixels of the four sides of the 
relevant block of the reference frame) is obtained. 

Thus, when the sum of the absolute values of the difference values 
between the contour pixels of a block of the current frame and the 
contour pixels of the relevant block of the reference frame is obtained, 
the number of calculating steps for obtaining a residual can be remarkably 
decreased. Consequently, the block matching process can be performed 
at high speed. In other words, when the size of a block is (1 6 x 1 6) 
pixels, to calculate all pixels of one block, 256 subtractions are required. 
In contrast, to calculate contour pixels, only 60 subtractions are required. 
In addition, since the contour pixels are not thinned out, a motion vector 
can be obtained in the accuracy of one pixel. 

Next, a picture encoding apparatus according to a fifth embodiment 
of the present invention will be described. 

Fig. 27 shows the structure of the picture encoding apparatus 401 
according to the fifth embodiment of the present invention. Referring to 
Fig. 27, the picture encoding apparatus (denoted by 401) has a frame 
buffer 202, a motion detecting portion 203, a residual information 
generating portion 204, a global vector detecting portion 205, and a 
controlling portion 206. Picture data are input to the frame buffer 202. 
The motion detecting portion 203 detects a motion component of picture 
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data stored in the frame buffer 202. The residual information generating 
portion 204 generates motion residual information AD. The global vector 
detecting portion 205 detects a motion vector of the entire picture. The 
controlling portion 206 outputs parameters and so forth for an encoding 
5 process to individual portions of the apparatus. 

The frame buffer 202 inputs picture data from an external 
apparatus and stores picture data frame by frame. The frame buffer 202 
outputs picture data to the motion detecting portion 203, the residual 
information generating portion 204, the global vector detecting portion 

10 205, and a calculating portion 207 at a predetermined timing under the 
control of the controlling portion 206. 

The global vector detecting portion 205 samples picture data 
(received from the frame buffer 202) as a plurality of macro blocks and 
detects a motion vector of the macro blocks. In other words, since the 

1 5 global vector detecting portion 205 obtains a motion vector of ail the 
macro blocks, the global vector detecting portion 205 obtains a motion 
vector of the entire picture (namely, a global vector) and supplies the 
detected global vector to the motion detecting portion 203. 

In reality, as shown in Fig. 28, the global vector detecting portion 

20 205 extracts a plurality of macro blocks at different positions of one 

picture and detects one motion vector from the extracted macro blocks. 
At this point, the global vector detecting portion 205 applies a motion 
vector from each of the extracted macro blocks to an evaluation function 
so as to obtain the global vector. The global vector detecting portion 205 

25 uses, as an evaluation function, a functional expression for calculating the 
average of motion vectors of extracted macro blocks so as to obtain the 
global vector. 

Alternatively, the global vector detecting portion 205 may extract a 
plurality of adjacent macro blocks so as to obtain a global vector. In 
30 other words, the global vector detecting portion 205 may obtain a global 
vector with each macro block of (16 x 16) pixels or with each small area 
of, for example, (32 x 32) pixels. 
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As another alternative method, as shown -in FLg. 29, the global 
vector detecting portion 205 may obtain global vectors for areas A, B, 
and C into which one screen is vertically divided. Thus, in the case that 
the area A is a picture of a mountain that has been photographed as a far- 
5 distance picture and the area C is a picture of a flower that has been 

photographed as a near-distance picture and that the area A and the area 
C have been panned, even if one screen has two pictures that move with 
respect to each other, global vectors for individual areas can be obtained. 
Each area may overlap. 

10 Referring again to Fig. 27, the motion detecting portion 203 

detects a motion vector MV of each macro block (composed of 1 6 x 1 6 
pixels) of picture data stored in the frame buffer 202. The motion 
detecting portion 203 block matches a macro block of a reference frame 
with a macro block that is read from the frame buffer 202 and detects a 

1 5 motion vector MV. The motion detecting portion 203 supplies the 

detected motion vector MV to the residual information generating portion 
204 and the controlling portion 206. At this point, the motion detecting 
portion 203 generates a motion vector MV with the global vector received 
from the global vector detecting portion 205. In other words, when the 

20 motion detecting portion 203 block matches each macro block in a 

predetermined search area, the motion detecting portion 203 varies each 
macro block in the search area with an offset of the global vector and 
obtains a motion vector MV. The motion detecting portion 203 varies the 
center position of the search area corresponding to the global vector so as 

25 to block match each macro block. 

The residual information generating portion 204 receives the motion 
vector MV from the motion detecting portion 203. In addition, the 
residual information generating portion 204 receives each macro block of 
picture data from the frame buffer 202. With the motion vector MV and 

30 picture data, the residual information generating portion 204 obtains the 
sum of the absolute values of difference values between moving 
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components as residual information AD and supplies the residual 
information AD to the controlling portion 206. 

The controlling portion 206 determines a macro block type for the 
encoding process with the motion vector MV received from the motion 
5 detecting portion 203 and the motion residual information AD received 
from the residual information generating portion 204. The controlling 
portion 206 determines whether the current macro block is an inter-macro 
block or an intra-macro block corresponding to 7 for example, the picture 
type. The inter-macro block is a macro block that is motion-compensated 
10 with a motion vector MV and encoded with a residual. In contrast, the 
n intra-macro block is a macro block that is simply encoded without motion 
^ components. 

I* The controlling portion 206 generates control information that 

?3 causes switches 217 and 218 to operate corresponding to the determined 

5 macro block type. In addition, the controlling portion 206 supplies the 
a . motion vector MV received from the motion detecting portion 203 to the 
y motion compensating portion 216. 

j!t The picture encoding apparatus 201 also has a calculating portion 

O 207, a DCT process portion 208, a quantizing process portion 209, a 

20 variable length code encoding portion 210, and a buffer 21 1. The 

calculating portion 207 receives a picture signal from the frame buffer 
202. The DCT process portion 208 performs a DCT {Discrete Cosine 
Transform) process for picture data. The quantizing process portion 209 
quantizes a DCT coefficient received fiom the DCT process portion 208. 

25 The variable length code encoding portion 210 compresses a DCT 

coefficient received from the quantizing process portion 209 with variable 
length code. The buffer 211 stores picture data received from the 
variable length code encoding portion 210. 

The DCT process portion 208 performs a two-dimensional DCT 

30 process for each block of (8 x 8) pixels of picture data received from the 
calculating portion 207. The DCT process portion 208 supplies a DCT 
coefficient to the quantizing process portion 209. 
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The quantizing process portion 209 quantizes a DCT coefficient 
received from the DCT process portion 208 with a quantizing scale that 
varies corresponding to each block. The quantizing process portion 209 
supplies the quantized DCT coefficient to the variable length code 
5 encoding portion 210 and an inversely quantizing process portion 212. 

The variable length code encoding portion 210 receives a DCT 
coefficient from the quantizing process portion 209 and a motion vector 
MV from the controlling portion 206. With such information, the variable 
length code encoding portion 210 performs an encoding process. The 
10 variable length code encoding portion 210 performs an encoding process 
n with variable length code corresponding to MPEG syntax and performs a 

42 header process, a code generating process, and so forth so as to generate 

|i picture data. The variable length code encoding portion 210 supplies the 

q encoded picture data to the buffer 211. 

15 The buffer 21 1 stores picture data received from the variable length 

* code encoding portion 210 and outputs the picture data as a bit stream at 

%i a predetermined timing under the control of the controlling portion 206. 

In addition, the picture encoding apparatus 201 has an inversely 
O quantizing process portion 212, an inversely DCT process portion 213, a 

20 calculating unit 214, a buffer 215, and a motion compensating portion 
216. The inversely quantizing process portion 21 2 inversely quantizes a 
DCT coefficient received from the quantizing process portion 209. The 
inversely DCT process portion 213 inversely performs a DCT process for a 
DCT coefficient received from the inversely quantizing process portion 
25 212. The calculating unit 214 receives picture data from the inversely 
DCT process portion 21 3. The buffer 21 5 stores picture data. The 
motion compensating portion 216 motion-compensates picture data 
received from the buffer 215. 

The inversely quantizing process portion 212 inversely quantizes a 
30 DCT coefficient received from the quantizing process portion 209. The 

inversely quantizing process portion 212 inversely quantizes data received 
from the quantizing process portion 209 with the quantizing scale thereof 
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and supplies the resultant DCT coefficient to the inversely DCT process 
portion 213. 

The inversely DCT process portion 213 inversely performs a DCT 
process for a DCT coefficient received from the inversely quantizing 
5 process portion 212 and supplies the resultant DCT coefficient to the 
calculating unit 214. The calculating unit 214 receives picture data that 
has been processed in the inversely DCT process portion 213. In 
addition, the calculating unit 214 receives picture data (that has been 
motion-compensated) through the switch 217. The calculating unit 214 
1 0 adds the motion-compensated picture data and the picture data received 
from the inversely DCT process portion 213 and supplies the resultant 
data to the buffer 215. 

The buffer 215 receives each macro block of picture data from the 
calculating unit 214 and stores the picture data. When the motion 
15 compensating portion 216 motion-compensates picture data, predictive 
picture data is read from the buffer 21 5. 

The motion compensating portion 216 reads each macro block of 
predictive picture data from the buffer 215 corresponding to a motion 
vector MV. 

20 When the picture encoding apparatus 201 generates an intra macro 

block, each macro block of picture data stored in the frame buffer 202 is 
supplied to the DCT process portion 208 and the quantizing process 
portion 209 through the calculating unit 207. The DCT process portion 
208 performs the DCT process for each macro block of the picture data. 

25 The quantizing process portion 209 quantizes the picture data received 
from the DCT process portion 208. The variable length code encoding 
portion 210 encodes the picture data received from the quantizing 
process portion 209 with variable length code and outputs the resultant 
data as a bit stream through the buffer 211. The resultant signal that has 

30 been processed by the quantizing process portion 209 and the variable 
length code encoding portion 210 is restored to picture data by the 
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inversely quantizing process portion 212 and the inversely DCT process 
portion 213 and temporarily stored to the buffer 215. 

When the picture encoding apparatus 201 generates an inter macro 
block, the motion detecting portion 203 detects a motion component of 
5 picture data stored in the frame buffer 202, so as to generate a motion 
vector MV. In addition, the residual information generating portion 204 
generates residual information AD. The motion vector MV is supplied to 
the motion compensating portion 216 through the controlling portion 206. 
The motion compensating portion 216 motion-compensates picture data 

10 stored in the buffer 21 5 (when the I picture is generated, the picture data 
is stored to the buffer 215). Thus, the motion compensating portion 216 
generates predictive data. The motion compensating portion 216 motion- 
compensates each macro block. The switches 217 and 218 are closed 
corresponding to a switch control signal received from the controlling 

1 5 portion 206. The calculating unit 207 subtracts the predictive picture 

data received from the motion compensating portion 216 from the picture 
data stored in the frame buffer 202. The DCT process portion 208 and 
the quantizing process portion 209 perform the above-described 
processes. The variable length code encoding portion 210 encodes 

20 picture data and outputs the resultant data as a bit stream through the 
buffer 211. 

Fig. 30 is a flow chart showing a process for detecting a motion 
vector MV. The process is performed by the picture encoding apparatus 
201. 

25 Referring to Fig. 30, at step S301 , picture data of one frame is 

input to the frame buffer 202. In the process shown in Fig. 30, at steps 
S302 to S304, a global vector is detected. At step S305, the motion 
detecting portion 203 generates a motion vector MV for each macro 
block. 

30 At step S302, the global vector detecting portion 205 inputs 

picture data of each frame stored in the frame buffer 202 and extracts a 
plurality of macro blocks from the picture data, as shown in Fig. 28. At 
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step S302, as shown in Fig. 29, one screen may be divided into a 
plurality of areas and a plurality of macro blocks may be extracted 
therefrom. 

At step S303, the global vector detecting portion 205 detects a 
motion vector of each macro block detected at step S302. 

At step S304, the global vector detecting portion 205 applies a 
motion vector of each macro block to an evaluation function so as to 
generate a global vector. The global vector detecting portion 205 
calculates the average of motion vectors of macro blocks and generates a 
global vector. 

At step S305, the motion detecting portion 203 receives each 
macro block of picture data, block matcKes each macro block with the 
global vector detected at step S304, and detects a motion vector of each 
macro block. At this point, the motion detecting portion 203 varies the 
center position of a search area corresponding to the global vector and 
block matches each macro block. 

At step S306, the motion detecting portion 203 detects a motion 
vector MV of each macro block corresponding to the detected result at 
step S305 and supplies the motion vector MV of each macro block to the 
residual information generating portion 204 and the controlling portion 
206. 

In the picture encoding apparatus 201, before the motion detecting 
portion 203 generates a motion vector MV of each macro block, the 
global vector detecting portion 205 detects a global vector that 
represents one motion vector of the entire picture. Thus, the motion 
detecting portion 203 does not need to detect a motion vector MV of 
each macro block in a wide area. Consequently, the process for detecting 
a motion vector MV can be performed with a reduced number of 
calculating steps. In other words, in the picture encoding apparatus 201, 
even if a picture that is moving is panned, it is not necessary to cause the 
global vector detecting portion 205 to obtain a global vector of the entire 
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picture and to. detect a motion vector of each macro block in a wide 
search area. 

In addition, using the picture encoding apparatus 201, even if a 
picture moves at high speed on the entire screen, a motion vector of each 
5 macro block can be easily detected. 

Moreover, using the picture encoding apparatus 201, one screen 
may be divided into a plurality of areas. The global vector detecting 
portion 5 calculates a global vector of each area. Thus, even if a picture 
moves a lot on the screen, a motion vector MV can be effectively 
10 detected. 

Next, a sixth embodiment of the present invention will be 
described. 

Fig. 31 is a block diagram showing the structure of a picture 
encoding apparatus according to the sixth embodiment of the present 
15 invention. 

Fig. 31 shows the structure of the picture encoding apparatus 
according to the sixth embodiment of the present invention. Referring to 
Fig. 31, the picture encoding apparatus {denoted by 301) has a frame 
buffer 302, a motion detecting portion 303, a residual information 

20 generating portion 304 and a controlling portion 305. Picture data are 

input to the frame buffer 302. The motion detecting portion 303 detects 
a motion component of picture data stored in the frame buffer 302. The 
residual information generating portion 304 generates motion residual 
information AD. The controlling portion 306 outputs parameters and so 

25 forth for an encoding process to individual portions of the apparatus. 
The frame buffer 302 inputs picture data from an external 
apparatus and stores picture data frame by frame. The frame buffer 302 
outputs picture data to the motion detecting portion 303, the residual 
information generating portion 304 and a calculating portion 307 at a 

30 predetermined timing under the control of the controlling portion 305. 

The motion detecting portion 303 detects a motion vector MV of 
each macro block (composed of 1 6 x 1 6 pixels) of picture data stored in 



- 58 - 



the frame buffer 302. The motion detecting portion 303 block matches a 
macro block of a reference frame with a macro block that is read from the 
frame buffer 302 and detects a motion vector MV. The motion detecting 
portion 303 supplies the detected motion vector MV to the residual 
information generating portion 304 and the controlling portion 305. 

The residual information generating portion 304 receives the motion 
vector MV from the motion detecting portion 303. In addition, the 
residual information generating portion 304 receives each macro block of 
picture data from the frame buffer 302. With the motion vector MV and 
picture data, the residual information generating portion 304 obtains the 
sum of the absolute values of difference values between moving 
components as residual information AD and supplies the residual 
information AD to the controlling portion 305 and a skip controlling 
portion 310. 

The controlling portion 305 determines a macro block type for the 
encoding process with the motion vector MV received from the motion 
detecting portion 303 and the motion residual information AD received 
from the residual information generating portion 304. The controlling 
portion 305 determines whether the current macro block is an inter-macro 
block or an intra-macro block corresponding to, for example, the picture 
type. The inter-macro block is a macro block that is motion-compensated 
with a motion vector MV and encoded with a residual. In contrast, the 
intra-macro block is a macro block that is simply encoded without moving 
components. 

The controlling portion 305 generates control information that 
causes switches 317 and 318 to operate corresponding to the determined 
macro block type. In addition, the controlling portion 305 supplies the 
motion vector MV received from the motion detecting portion 303 to the 
motion compensating portion 316. 

The picture encoding apparatus 301 also has a calculating portion 
306, a DCT process portion 307, a quantizing process portion 308, a 
variable length code encoding portion 309, the above-mentioned skip 
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controlling portion 310, and a buffer 311. The calculating portion 306 
receives a picture signal from the frame buffer 302. The DCT process 
portion 307 performs a DCT (Discrete Cosine Transform) process for 
picture data. The quantizing process portion 308 quantizes a DCT 
coefficient received from the DCT process portion 307. The variable 
length code encoding portion 309 compresses a DCT coefficient received 
from the quantizing process portion 308 with variable length code. The 
skip controlling portion 310 controls the DCT process portion 307, the 
quantizing process portion 308, the variable length code encoding portion 
309, and so forth. The buffer 31 1 stores picture data that have been 
encoded. 

The DCT process portion 307 performs a two-dimensional DCT 
process for each block of (8 x 8) pixels of picture data received from the 
calculating portion 306. The DCT process portion 307 supplies a DCT 
coefficient to the quantizing process portion 308. 

The quantizing process portion 308 quantizes a DCT coefficient 
received from the DCT process portion 307 with a quantizing scale that 
varies corresponding to each macro block. The quantizing process portion 
308 supplies the quantized DCT coefficient to the variable length code 
encoding portion 309 and an inversely quantizing process portion 312. In 
addition to the quantizing process, the quantizing process portion 308 
generates a CBP (Coded Block Pattern). When the quantizing process 
portion 308 generates the CBP, it supplies information that represents the 
CBP to the variable length code encoding portion 309. 

The skip controlling portion 310 generates a skip control signal that 
causes the DCT process portion 307 and the quantizing process portion 
308 to skip the DCT process and the quantizing process corresponding to 
the motion residual information received from the residual information 
generating portion 304. The skip controlling portion 310 receives motion 
residual information AD from the motion detecting portion 303, predicts 
the value of the CBP with the motion residual information AD, and sets 
the DCT coefficient to "0" corresponding to the value of the CBP. When 
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the skip controlling portion 31 Onsets the DCT coefficient to "0", the skip 
controlling portion 310 supplies the skip control signal to the motion 
compensating portion 316, the DCT process portion 307, the quantizing 
process portion 308, and the variable length code encoding portion 309. 
5 Thus, when the skip controlling portion 310 sets the value of the CBP 
(namely, the DCT coefficient) to "0", it causes such portions to skip their 
processes. 

When the skip controlling portion 310 sets the DCT coefficient to 
"0", the skip controlling portion 310 compares the motion residual 

1 0 information AD received from the residual information generating portion 
304 with a predetermined value. The predetermined value is designated 
by, for example, the user. In other worcfs, when the motion residual 
information AD is smaller than the predetermined value, the skip 
controlling portion 310 determines that the value of the CBP is small and 

1 5 supplies a skip control signal (that substitutes "0" to the DCT coefficient) 
to the above-described portions. In contrast, when the motion residual 
information AD is not smaller than the predetermined value, the skip 
controlling portion 310 does not generate the skip control signal. 

Alternatively, the skip controlling portion 310 may determine the 

20 predetermined value with information obtained in the encoding process 
(the information is such as the bit rate of the variable length code 
encoding process of the variable length code encoding portion 309 and 
the quantizing scale of the quantizing process of the quantizing process 
portion 308) and compares the predetermined value with the motion 

25 residual information AD. At this point, the skip controlling portion 310 
compares the motion residual information for each macro block with the 
predetermined value and generates the skip control signal corresponding 
to the compared result. 

As another alternative method, the skip controlling portion 310 may 

30 use the mean value MAD of the motion residual information AD of each 
macro block instead of the motion residual information AD. In this case, 
the mean value MAD is generated by the motion compensating portion 
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316. The skip controlling portion 310 receives the mean value MAD from 
the motion compensating portion 316 and sets the DCT coefficient to M 0" 
corresponding to the mean value MAD. The detailed operation of the skip 
controlling portion 310 will be described later. 
5 The variable length code encoding portion 309 receives a DCT 

coefficient from the quantizing process portion 308 and a motion vector 
MV from the controlling portion 305. With such information, the variable 
length code encoding portion 309 performs an encoding process. The 
variable length code encoding portion 309 performs an encoding process 
10 with variable length code corresponding to MPEG syntax and performs a 
header process, a code generating process, and so forth so as to generate 
picture data. The variable length code encoding portion 309 supplies the 
encoded picture data to the buffer 311. 

The variable length code encoding portion 309 receives information 
1 5 that represents that the value of the CBP is "0" from the quantizing 

process portion 308. When there is no motion vector MV, the variable 
length code encoding portion 309 may skip a macro block corresponding 
to the macro block type. 

The buffer 31 1 stores picture data received from the variable length 
20 code encoding portion 309 and outputs the picture data as a bit stream at 
a predetermined timing under the control of the controlling portion 305. 

In addition, the picture encoding apparatus 301 also has an 
inversely quantizing process portion 312, an inversely DCT process 
portion 313, a calculating unit 314, a buffer 315, and a motion 
25 compensating portion 316. The inversely quantizing process portion 312 
inversely quantizes a DCT coefficient received from the quantizing 
process portion 308. The inversely DCT process portion 313 inversely 
performs a DCT process for a DCT coefficient received from the inversely 
quantizing process portion 312. The calculating unit 314 receives picture 
30 data from the inversely DCT process portion 31 3. The buffer 31 5 stores 
picture data. The motion compensating portion 316 motion-compensates 
picture data received from the buffer 315. 
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The inversely quantizing process portion 312 inversely quantizes a 
DCT coefficient received from the quantizing process portion 308. The 
inversely quantizing process portion 312 inversely quantizes data received 
from the quantizing process portion 308 with the quantizing scale thereof 
5 and supplies the resultant DCT coefficient to the inversely DCT process 
portion 313. 

The inversely DCT process portion 313 inversely performs a DCT 
process for a DCT coefficient received from the inversely quantizing 
process portion 312 and supplies the resultant DCT coefficient to the 

10 calculating unit 314. The calculating unit 314 receives picture data that 
have been processed in the inversely DCT process portion 313. In 
addition, the calculating unit 314 receives picture data that have been 
motion-compensated through the switch 317. The calculating unit 314 
adds the motion-compensated picture data and the picture data received 

15 from the inversely DCT process portion 313 and supplies the resultant 
data to the buffer 315. 

The buffer 315 receives picture data from the calculating unit 314 
and stores the picture data. When the motion compensating portion 316 
motion-compensates picture data, predictive picture data are read from 

20 the buffer 315. 

The motion compensating portion 316 reads each macro block of 
predictive picture data from the buffer 315 corresponding to a motion 
vector MV. The motion compensating portion 316 supplies the motion 
vector MV received from the controlling portion 305 to the calculating 

25 portion 306 corresponding to the predictive picture data. 

When the picture encoding apparatus 301 generates an I (Intra) 
macro block, each macro block of picture data stored in the frame buffer 
302 is supplied to the DCT process portion 307 and the quantizing 
process portion 308 through the calculating unit 306. The DCT process 

30 portion 307 performs the DCT process for each macro block of the 

picture data. The quantizing process portion 308 quantizes the picture 
data received from the DCT process portion 307. The variable length 
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code encoding portion 309 encodes the picture data-received from the 
quantizing process portion 308 with variable length code and outputs the 
resultant data as a bit stream through the buffer 311. The resultant 
signal that has been processed by the quantizing process portion 308 and 
5 the variable length code encoding portion 309 is restored to picture data 
by the inversely quantizing process portion 312 and the inversely DCT 
process portion 313 and temporarily stored to the buffer 315. 

When the picture encoding apparatus 301 generates an inter macro 
block, the motion detecting portion 303 detects a motion component of 
10 picture data stored in the frame buffer 302 so as to generate a motion 
vector MV. In addition, the residual information generating portion 304 
generates residual information AD. The motion vector MV is supplied to 
the motion compensating portion 316 through the controlling portion 305. 
The motion compensating portion 316 motion-compensates picture data 
1 5 stored in the buffer 315 (when the I picture is generated, the picture data 
are stored to the buffer 315). Thus, the motion compensating portion 
316 generates predictive data. The motion compensating portion 316 
motion-compensates each macro block. The switches 317 and 318 are 
closed corresponding to a switch control signal received from the 
20 controlling portion 305. The calculating unit 306 subtracts the predictive 
picture data received from the motion compensating portion 316 from the 
picture data stored in the frame buffer 302. The DCT process portion 
307 and the quantizing process portion 308 perform the above-described 
processes. The variable length code encoding portion 309 encodes 
25 picture data and outputs the resultant data as a bit stream through the 
buffer 311. 

Fig. 32 is a flow chart showing an encoding process of the picture 
encoding apparatus 301 . In the flow chart shown in Fig. 32, a motion 
vector MV is detected from picture data stored in the frame buffer 302. 
30 An inter macro block or an intra macro block is generated corresponding 
to the detected motion vector MV. 
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At step S401 , picture data of -one frame are input to the frame 
buffer 302. 

At step S402, the motion detecting portion 303 detects the motion 
of the picture data stored in the frame buffer 302 and detects the motion 
5 as a motion vector MV. The residual information generating portion 304 
generates motion residual information AD with the motion vector MV. 
The motion vector MV and the motion residual information AD are 
supplied to the controlling portion 305 and the skip controlling portion 
310, respectively. 

10 At step S403, the skip controlling portion 310 compares the 

motion residual information AD received from the residual information 

generating portion 304 with a predetermined value. When the determined 

result at step S403 is No (namely, the motion residual information AD is 

not less than the predetermined value), the flow advances to step S404. 
1 5 In contrast, when the determined result at step S403 is Yes (namely, the 

motion residual information AD is less than the predetermined value), the 

flow advances to step S407. 

At step S404, with the motion vector MV received from the 

controlling portion 305, the predictive picture data are generated and 
20 supplied to the calculating portion 306. Corresponding to the calculated 

result of the calculating unit 306, the motion compensating portion 316 

compensates the motion of the picture data. 

At step S405, the calculating unit 306 subtracts the predictive 

picture data that has been motion-compensated at step S404 from the 
25 picture data received from the frame buffer 302. The DCT process 

portion 307 performs the DCT process for the picture data received from 

the calculating unit 306. 

At step S406, the quantizing process portion 308 quantizes the 

DCT coefficient generated by the DCT process portion 307 at step S405. 
30 In contrast, when the determined result at step S403 is Yes 

(namely, the motion residual information AD is less than the 

predetermined value), the flow advances to step S407. At step S407, 
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the skip controlling portion 310 generates the skip control signal that 
causes the DCT process portion 307, the quantizing process portion 308, 
the variable length code encoding portion 309, and the motion 
compensating portion 316 to skip their processes and supplies the skip 
5 control signal to these portions. In other words, picture data stored in the 
buffer 315 is supplied to the variable length code encoding portion 309. 

At step S408, the variable length code encoding portion 309 
determines whether or not the value of the CBP is "0". When "0" has 
been set to the DCT coefficient at step S407, the variable length code 

10 encoding portion 309 determines that the value of the CBP is "0". 

At step S409, the variable length code encoding portion 309 
performs, for example, a header process'and a variable length code 
generating process and outputs the encoded picture data as a bit stream 
through the buffer 311. A macro block of which the value of the CBP is 

15 "0," determined at step S408, are data composed of header, MB 

increment, CBP, vector, and so forth rather than data of (16 x 16) pixels. 
In other words, when a macro block of which the value of the CBP is "0" 
is detected, the macro block is output as the same picture as the 
preceding picture. 

20 In the process of the picture encoding apparatus 301 (see Fig. 32), 

the predetermined value and the motion residual information AD are 
compared so as to determine whether or not the DCT coefficient is "0". 
Alternatively, instead of the predetermined value, it can be determined 
whether or not the DCT coefficient is "0" corresponding to an evaluation 

25 function with a quantizing scale received from the quantizing process 
portion 308, an occupation amount of picture data stored in the buffer 
315, a bit rate, and so forth. 

Since the picture encoding apparatus 301 has the skip controlling 
portion 310 that skips the motion compensating process, the DCT 

30 process, and the quantizing process corresponding to the compared result 
of which the motion residual information AD has been compared with the 
predetermined value, the process time necessary for the encoding process 
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for picture data whose DCT coefficient finally becomes "0" can be 
shortened. 

In addition, according to the picture encoding apparatus 301, the 
process time for the DCT process and the quantizing process in a real 
time encoder and so forth is not strict. Thus, the picture encoding 
apparatus 301 can be easily designed and power consumption thereof 
can be reduced. 

In addition, according to the picture encoding apparatus 301, even 
if the encoding process is performed by software, the load of the process 
of for example a CPU (Central Processing Unit) can be reduced. 

According to the present invention, when a motion vector is 
obtained, a block of a reference frame and a block of the current frame 
are orthogonally transformed into frequency data. When picture data are 
transformed into frequency data and a residual between the block of the 
reference frame and the block of the current frame is obtained, the 
number of calculating steps is remarkably decreased. Thus, the process 
can be performed at high speed. Consequently, the process can be 
sufficiently performed by software. 

According to the present invention, in a loop of a block matching 
calculation, the sum of the absolute values of the difference values 
between pixels of a block of the current frame and pixels of the relevant 
block of the reference frame is compared with a predetermined threshold 
value. When the sum exceeds the threshold value, the process is 
stopped. Thus, since the number of calculating steps is deceased, a 
motion vector can be searched at high speed. 

The initial threshold value is set corresponding to the value of the 
sum of mean discrete absolute values (MAD) and the residual AD (0, 0) at 
the origin. In the case that the threshold value is set corresponding to the 
sum MAD and the residual AD (0, 0) at the origin, when the inter-frame 
encoding process is performed, a motion vector can be reliably detected. 
In contrast, when the intra-frame encoding process is performed, the 
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process is stopped. Thus, the block matching process can be effectively 
performed. 

In addition, when a motion vector is searched for the first time, the 
threshold value obtained corresponding to the sum MAD and the residual 
5 AD (0, 0) at the origin is used. As the threshold value for the block 
matching process performed a second time for the same block, the 
original threshold value or the detected sum is used, whichever is smaller. 
In other words, since the threshold value that is the minimum value that 
has been obtained so far is used, a motion vector can be effectively 
10 detected. 

As described above, according to the motion detecting apparatus 
W and the motion detecting method of the present invention, a plurality of 

macro blocks are extracted from a picture. A motion vector of the 
?5 extracted macro blocks is detected. With the detected motion vector, a 

HM 5 motion vector of the entire picture is calculated. With the motion vector 
a of the entire picture, a motion vector of each macro block is calculated. 

Cf Thus, before a motion vector of each macro block is calculated, a motion 

Vi vector of the entire picture can be obtained. Consequently, according to 

C3 the motion detecting apparatus and the motion detecting method of the 

20 present invention, a motion vector of a picture that moves a lot on the 
entire screen can be easily detected. In addition, the number of 
calculating steps for the process for detecting the motion of a picture can 
be remarkably decreased. 

As described above, according to the picture encoding apparatus 
25 and the picture encoding method of the present invention, a motion 
vector of a pixel block of picture data is detected and motion residual 
information thereof is generated. The motion residual information is 
compared with a predetermined value. A predetermined process 
necessary for an encoding process is performed for picture data. 
30 Corresponding to the determined result, a predetermined process of a 

picture data process means is skipped. The process time for the encoding 
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process for a picture whose DCT coefficient finally becomes "0" can be 
shortened. 

According to the present invention, when a motion vector is 
obtained by a block matching process, a residual is obtained with the sum 
5 of the absolute values of the difference values between contour pixels of 
a block of a reference frame and contour pixels of the relevant block of 
the current frame. Thus, the number of calculating steps is decreased. 
Consequently, the process can be performed at high speed. Since the 
sum of the absolute values of the difference values between all contour 
10 pixels of a block of the reference frame and all contour pixels of the 
q relevant block of the current frame is obtained, a motion vector can be 
^ accurately detected. 

H Although the present invention has been shown and described with 

p respect to a best mode embodiment thereof, it should be understood by 

5 those skilled in the art that the foregoing and various other changes, 
I omissions and additions in the form and detail thereof may be made 

therein without departing from the spirit and scope of the present 
;^ invention. 
Q 
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